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PREFACE 


This book is intended to serve as an introduction to the technology of 
thermal imaging, and as a compendium of the conventions which form the 
basis of current FLIR practice. Those topics in thermal imaging which are 
covered adequately elsewhere are not treated here, so there is no discussion 
of detectors, cryogenic coolers, circuit design, or video displays. Useful infor- 
mation which is not readily available because of obscure publication is 
referenced as originating from personal communications. 

Virtually everyone with whom I have worked in the thermal imaging 
business has contributed to the book through the effects of conversations 
and ideas. I gratefully proffer blanket appreciation to all those who have 
helped in that way to make this book possible. The contributions of five 
people, however, bear special mention: Bob Sendall, Luke Biberman, Pete 
Laakmann, George Hopper, and Norm Stetson. They, more than any others, 
have positively influenced my thinking. 

The many supervisors I have had in three years of writing this book 
have generously supported my efforts. Among them, I especially wish to 
thank Ed Sheehan and Don Looft of the Army’s Night Vision Laboratory, 
who originally supported the work, and Bob Norling, Bob Rynearson, and 
Sheldon Busansky of Honeywell who continued that support. This book 
was ultimately made possible by its inclusion in the Optical Physics and 
Engineering Series. For that, I wish to thank Bill Wolfe, the Series editor, 
His confidence, patience, and criticisms are much appreciated. I also wish to 
recognize and to thank Miss Beth Whittemore of Honeywell who typed 
numerous drafts of the manuscript, and Miss Gail Logan of Frank Thompson 
Associates, who supervised the preparation of the final manuscript. Finally, 
my profound appreciation goes to LaVonne, my editor-at-home, whose 
editorial skills have clarified many passages. 

Readers are encouraged to contact me at Honeywell to transmit 
corrections, alternate views, and new sources of information. 


Mike Lloyd 
Acton, Massachusetts 
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CHAPTER ONE — INTRODUCTION 


1.1 The Function of Thermal Imaging Systems 


Thermal imaging systems extend our vision beyond the short-wavelength 
red into the far infrared by making visible the light naturally emitted by 
warm objects. Because of the eye’s lack of response in the absence of 0.4 to 
0.7 um light, a device is needed which will image the dominant energy at 
night as the eye does during the day. This night eye must respond generally 
to the photons emitted at body temperature because these dominate when 
reflected solar radiation is absent. Specifically, it must have spectral response 
at wavelengths where significant emissivity, temperature, and reflectivity dif- 
ferences exist in the scene. This is necessary to assure that radiation patterns 
will exist which are sufficiently similar to the corresponding visual reflectivity 
patterns to make visual interpretation of the converted scene possible.* This 
spectral sensitivity must also coincide with an atmospheric transmission 
“window” which does not excessively absorb the desired radiation. 

The difficulty of producing thermal images may be appreciated by con- 
sidering the perfection with which the eye produces visible images. The eye 
is an optimized sensor of visible radiation in three respects. First, the eye's 
spectral response range of 0.4 to 0.7 um coincides with the peak of 
the sun's spectral output. Approximately 38 percent of the sun's radiant 
energy is concentratedt in this band, and terrestrial materials tend to have 
good reflectivities there. Second, the eye is an ideal quantum-noise-limited 
device because the retinal radiation detectors have low noise at the quantum 
energy levels in this band. Third, the response of these retinal detectors to the 
photons emitted at body temperature is negligible, so that this long wave- 
length thermal energy does not mask the response to the desired wavelengths. 
This optimization allows the eye to perform its primary functions, which are 


*We will be concerned solely with passive imaging systems which sense natural scene radiation, 
rather than with active systems which sense the reflected radiation from a thermal illuminator. 


+Compare this with the 8 to 14 ит band where only about 0.08 percent of the sun's energy is found. 
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the detection of reflectivity differences in objects illuminated by 0.4 to 
0.7 micrometer radiation, discernment of patterns in these reflectivity dif- 
ferences, and the association of these patterns with abstractions derived from 
previous visual and other sensory experiences. To be as effective as the 
human eye, the night thermal eye must also image the dominant light, be 
quantum-limited, and reject extraneous light. 

Whereas natural visual spectrum images are produced primarily by re- 
flection and by reflectivity differences, thermal images are produced pri- 
marily by self-emission and by emissivity differences. Thus in thermal 
imaging, we are interested in thermally self-generated energy patterns. 
Thermal system capabilities are usually described in terms of scene tempera- 
tures rather than in radiometric terms. Therefore some clarifying definitions 
are necessary. All of the scene temperature, reflectivity, and emissivity con- 
tributions taken together can be represented at any point in the scene by an 
effective temperature at that point. This is the temperature which would 
produce the measured irradiance near the point if the point were an ideal 
blackbody radiator; that is, if the point radiated the maximum theoretically 
possible power at the effective temperature. Similarly, the irradiance mea- 
sured through an intervening attenuating atmosphere may be thought of as 
being produced by an apparent temperature less than the effective 
temperature. 

This simplification is possible because most thermal imaging systems 
have broad spectral bandpasses and accept all polarizations. Consequently 
they are insensitive to the mechanisms which produce the effective tempera- 
ture differences. The variations in the effective temperature of a scene tend 
to correspond to the details in the visual scene, so a thermal imaging system 
provides a visible analog of the thermal scene, hopefully with an efficient 
transfer of useful information from one spectral regime to the other. 
Wormser! discusses the desirability of thermal imaging and describes how 
conventional thermal imagers work. 

This book is concerned primarily with those mechanically-scanning de- 
vices which convert radiation in the far infrared spectral region to visible 
radiation in real time and at an information update rate (or frame rate) com- 
parable to that of television. For lack of a more descriptive abbreviation such 
as TV for television or I? for image intensification, thermal imaging users 
have adopted the term FLIR, the acronym for Forward Looking Infra-Red. 
This term distinguishes fast framing thermal imaging systems from downward 
looking single channel thermal mapping systems, and from single-framing 
thermographic cameras. Although the term FLIR originally implied an air- 
borne system, it is now used to denote any fast-framing thermal imager. 

Thermal imaging in a FLIR is produced in the following way. An optical 
system collects, spectrally filters, and focuses the infrared scene radiation 
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onto an optically-scanned multi-element detector array. The detectors con- 
vert the optical signals into analog electrical signals which are then amplified 
and processed for display on a video monitor. FLIR is similar in external 
aspects to television in that the final image often appears on a TV-type 
monitor operating at TV-type frame rates. Contrast (video gain) and bright- 
ness (background level) controls are used, and spatial patterns in the dis- 
played thermal scene strongly resemble those of the corresponding visible 
scene. 

The major applications of real time thermal infrared imaging presently 
are for military and intelligence purposes. The armed forces have publicized? 
the fact that they have thermal rifle scopes, missile and gun sights, and air- 
borne FLIRs. Their functions include target acquisition, fire control, aerial 
navigation, surveillance, and intelligence gathering. Non-military applications 
of thermal imaging are limited, but growing. They include thermal pollution 
surveys, breast cancer detection and other medical diagnoses, forest fire pre- 
vention, air-sea rescue, manufacturing quality control, detection of fissures 
and loose material in coal mines, preventive maintenance inspections of elec- 
trical power equipment, and earth resources surveys. As the utility of thermal 
imagery becomes more widely recognized, the list is certain to grow. Some 
likely additions are commercial aircraft landing and taxying aids, floating 
debris avoidance systems for surface-effect vessels, crime prevention, and 
smoke-penetration aids for fire fighters. 

Thermal imagers are superior in performance to other types of passive- 
sensing electro-optical imaging devices when operability at any time of the 
day or night and under all weather conditions is the primary consideration. 
The major reasons for this superiority are the high contrast rendition which is 
achieved and the good atmospheric transmission windows which are available. 
Image intensifiers and low light level television systems rely largely on re- 
flectance differences between the target and the background for detection 
and recognition. In the visible spectral region and when using non-color 
devices, broadband reflectance differences between interesting targets and 
their backgrounds tend to be moderate, especially when camouflage is 
employed. 

Thermal imaging systems, however, usually suppress the average (dc or 
background) value of the scene radiance, so that only scene variations around 
the average are displayed. Thus high image contrasts are achieved, especially 
since the camouflaging of temperature differences is very difficult. We will 
see in Chapter Ten that an observer’s image interpretation performance 
strongly depends on image contrast so thermal systems in this respect are 
superior to most visible wavelength devices. 

A minor disadvantage is that thermal scenes usually have fewer shadows 
than visible scenes, so that some three-dimensional contour information is 
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lost. This is usually offset by the fact that all surfaces in the thermal scene 
radiate energy so that all forward surfaces tend to be visible. Another minor 
disadvantage is that thermal emitters are not necessarily good visual re- 
flectors, so that visually bright objects may be dark in thermal scenes, and 
vice versa. Nonetheless, considerable intelligence is usually available through 
thermal imagery. Some often-cited examples are storage tank fluid levels, 
ship wakes, evidence of recent operation of a vehicle, operation of an indus- 
trial plant revealed by hot gaseous emissions or heated areas, and object 
identifications by noting characteristic surface heating patterns. We will now 
briefly trace the history of thermal imaging systems. 


1.2 A History of Thermal Imaging 


The first thermal imaging system was the circa-1930 Evaporagraph, a 
rather insensitive non-scanning device described in Section 8.7. The 
Evaporagraph could not satisfy most thermal imaging tasks because of in- 
herent contrast, sensitivity, and response time limitations. Two alternative 
approaches to thermal imaging were evident in the 1940’s. One was to de- 
velop discrete-detector, mechanically-scanning analogues of television sys- 
tems. The other was to develop an infrared vidicon, or some other non- 
mechanically-scanning device. The first approach has been immensely 
successful, while the second has had only modest success to date. Since non- 
scanning devices have not yet matched the performance of scanning systems, 
we will defer discussion of them until Chapter Eight. 

The original scanning thermal imagers were called thermographs. They 
were single detector element, two-dimensional, slow framing scanners which 
recorded their imagery on photographic film and consequently were not real- 
time devices. In 1952 the Army built the first thermograph in this country 
using a 16-inch searchlight reflector, a dual axis scanner, and a bolometer 
detector. This event was followed in the period from 1956 to 1960 by rapid 
development of Army-supported military thermographs, which have since 
come to be used almost exclusively for civilian applications. 

Up to the late 1950’s, the development of fast-framing thermal imagers 
was infeasible because detectors with fast time response were not available. 
Electrical signal frequency bandwidths had to be limited to a few hundred 
Hertz because poor detector response above this range gave low image signal- 
to-noise ratios. The development of cooled short-time-constant indium 
antimonide (InSb) and mercury doped germanium (Ge:Hg) photo-detectors 
made possible the first fast framing sensors. 

The first real-time FLIR sensor was an outgrowth of downward looking 
strip mapper technology. Strip mappers are essentially thermographs with the 
vertical scan motion generated by the aircraft motion relative to the ground, 
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and were developed and used most extensively by the Army and the Air 
Force for reconnaissance. 

The first operating long wavelength FLIR was built at the University of 
Chicago in 1956 with Air Force support. It was a modified AN/AAS-3 strip 
mapper designated the XA-1. The modification added a nodding elevation 
mirror to the counter-rotating wedge scanner of the mapper so that the single 
detector element traced out a two dimensional raster pattern. Due to the lack 
of a pressing military need following the Korean war, this development was 
not pursued further. 

To the author’s knowledge, the next real-time long wavelength device 
was a ground-based FLIR built by the Perkin-Elmer Corporation for the 
Army in 1960. This was called the Prism Scanner because it used two ro- 
tating refractive prisms to generate a spiral scan for its single element InSb de- 
tector. It had a 5-degree circular field with 1 milliradian detector subtense, a 
frame rate of 0.2 frame per second, a thermal sensitivity of about 1°C, and 
a long-persistence phosphor CRT display. The Prism Scanner launched the 
continuing development of ground based sensors for the Army and for civil- 
ian applications which still has not exhausted the possibilities for compact- 
ness and civil benefits. 

The airborne FLIR concept was resurrected in two independent pro- 
grams in the early 1960’s by the Air Force and Texas Instruments Inc., and by 
the Navy and Hughes Aircraft Company. Prototypes were completed and 
flight tested in 1965, and were so successful that they spawned an amazing 
proliferation of airborne FLIRs and applications. From that point on, the 
FLIR business burgeoned, and between 1960 and 1974 at least sixty dif- 
ferent FLIRs were developed and several hundred were produced. Surface 
and airborne FLIRs have come to be functionally similar and in many cases 
identical so that the term FLIR now properly connotes any real-time thermal 
imager. 

The FLIR technology reached maturity with the development of de- 
tector arrays which perform at or near theoretical thermal sensitivity limits 
while providing signal response fast enough to sustain the wide bandwidths 
necessary for high ratios of field-of-view to resolution. Compact and efficient 
cryogenic coolers and detector dewars made possible FLIRs with reasonable 
power consumptions, and continuing advances in electronic component 
packaging have further reduced signal processing volume and power con- 
sumption. FLIR technology is rapidly evolving toward the small size, low 
power, and moderate cost necessary to make FLIR a common electro- 
optical imaging device. 

When FLIR systems were in their infancy, the analysis and evaluation 
of thermal image quality were as tentative as the systems themselves. Per- 
formance was indicated by a spatial two-point resolving power, and by the 
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irradiance input which produced a unity signal-to-noise ratio output from 
the sensor. These were usually measured at the pre-amplified detector output, 
not at the visual analog display. The system “resolution”? and “thermal 
sensitivity" were dominated by then-poor detector performance, so image 
quality probably correlated reasonably well with the pre-amplifier 
measurements. 

However, as the technology advanced and low-noise, high-resolution 
components were developed, image quality did not improve accordingly. 
The reason is that the lessons of similar electro-optical technologies were not 
applied. The subsequent efflorescence of interest in image quality and the 
application to FLIR of ideas from other technologies form much of the sub- 
ject matter of this book. We shall see that there is much to be learned from 
modern optics, photography, television, and the psychophysics of vision. 
They suggest the fundamental image quality parameters which are applicable 
to FLIR, and which foster the analyses which made possible the television- 
like quality of current FLIR systems. 

Thermal imaging is a fascinating but demanding technology. A great 
deal of the satisfaction of working in it derives from synthesizing one’s own 
viewpoint from the seven basic disciplines involved. These disciplines are: 


€ radiation theory and target signatures, 

atmospheric transmission of thermal radiation, 

optical design, 

detector and detector cooler operation, 

electronic signal processing, 

video displays, and 

human search processes and visual perception using FLIR. 


The following section introduces the minimum number of thermal 
imaging concepts necessary to provide an understanding of the more de- 
tailed information in later chapters. 


1.3 Thermal Imaging Fundamentals 


The basic elements of any thermal imaging problem are described in the 
blocks of the sequential process of Figure 1.1. These elements are obvious, 
but failure to adequately analyze their effects in a given problem can lead to 
serious design errors, so we will dwell on them briefly. 
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To be detected, and subsequently recognized and identified, an object 
must produce an object-to-background apparent temperature difference of 
sufficient magnitude to distinguish it from other variations in the back- 
ground. The intervening atmosphere must not excessively blur or attenuate 
this signal. The operator must use an effective search procedure, know what 
to look for, and be able to point his sensor in the proper direction. The sen- 
sor must collect the radiant signal with an optic, and convert it to an elec- 
trical signal with good signal-to-noise ratio in a detector operating in an 
appropriate spectral band. This electrical signal must then be reconverted to 
an optical signal on a video display. 

Finally, the operator must be able to optimize his visual information 
extraction capability by using video gain and brightness controls. This entire 
process of conversion of an infrared scene to an analog visible scene must be 
performed so that contours, orientations, contrasts, and details are preserved 
or enhanced, and without introducing artifacts and excessive noise. The ob- 
server must be experienced and motivated, should have “‘good eyes’’, and 
should not be distracted from his visual search task. 

"А conventional scanning thermal imaging system is depicted in the block 
diagram of Figure 1.2, which shows the basic functions which are incor- 
porated into thermal imagers as a class. Particular systems may combine 
some functions and eliminate others, and there are many different specific 
implementations of these functions. Figure 1.3 shows a simple schematic of 
one possible implementation of a scanning FLIR. The optical system collects, 
spectrally filters, spatially filters, and focusses the radiation pattern from the 
scene onto a focal plane containing a single small detector element. An opto- 
mechanical scanner consisting of a set of two scanning mirrors, one sweeping 
vertically and the other horizontally, is interposed between the optical sys- 
tem and the detector. The ray bundle reaching the detector from the object 
moves as the mirrors move, tracing out a TV-like raster in object space as 
shown in Figure 1.3. This process of detecting the scene sequentially is 
called scene dissection. 

The incident electromagnetic field from the scene produces a dis- 
turbance within the detector which is usually proportional to the energy 
transported by the field. The type of disturbance depends on the type of 
detector, and may be an induced charge separation, an induced current, or a 
resistance change. Usually the incident radiation produces a voltage, and the 
two-dimensional object radiance pattern is converted into a one-dimensional 
analog voltage train by the scanning process. Àn infrared detector under a 
particular set of operating conditions is characterized by two performance 
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measures: the responsivity R and the specific detectivity D*. The respon- 
sivity is the gain of the detector expressed in volts of output signal per watt 
of input signal. The specific detectivity is the detector output signal-to- 
noise ratio for one watt of input signal, normalized to a unit sensitive 
detector area and a unit electrical bandwidth. 

These two measures are functions of the following parameters: 


electrical frequency: f[ Hz] 
optical wavelength: Alum] 
detector area: Aq ст? ] 
detector noise measurement bandwidth: Af[Hz] 
detector rms noise voltage measured in 

the bandwidth Af: Ма [volts] 
detector signal voltage as a function of 

wavelength and electrical frequency: МАО [volts] 


distribution of irradiance on the detector 
with wavelength: Њ [watt/cm? um]. 


Responsivity and specific detectivity are given? by: 


У, О) 
ROD = FAG” (1.1) 
and 
Ag AD)? V,(f) КО, (Ag Af)!? 
prap Ач Аб УУ _ ROD (Ag AD 


Ну Aq Ул V, | (1.2) 


If the device views objects against а terrestrial background, the radiation 
contrasts are very small, as will be demonstrated in Chapter Two. The corres- 
ponding analog voltage contrasts are thus also small, so that if these voltages 
are converted linearly into analog visual optical signals, observers will have 
difficulty recognizing and detecting objects because of the low image con- 
trast. Contrast commonly is increased by ac coupling the detector signal to 
the pre-amplifier. This is an approximate method for removing the back- 
ground signal. The simplest possible ac-coupling circuit is shown in 
Figure 1.4. This circuit blocks the dc component of the signal so that only 
variations from the scene average are passed. 
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C 
DETECTOR SIGNAL INPUT R OUTPUT 


Figure 1.4 An ac coupling circuit. 


The display scanning must be perfectly synchronized with the scene 
scanning, so both the display and the scan mirrors are driven by a scan 
synchronizer. For example, with a cathode ray tube display the scan- 
synchronization signals drive the vertical and horizontal beam deflection 
circuits. 

Any scanning thermal imager may be defined in terms of a few basic 
parameters, as follows. An individual detector commonly has a sensitive area 
which is rectangular with horizontal dimension “а” and vertical dimension 
“b”? in centimeters. The projection of the detector dimensions through an 
optic with effective focal length* f in cm gives detector angular subtenses, or 
projections, of о and 8 defined for small detectors by a=a/f and В = b/f 
radians. Angular subtense is usually expressed in units of milliradians 
(10-3 radian). The angular dimensions in object space within which objects 
are imaged is called the field of view of the sensor. FLIRs conventionally 
have rectangular fields of view with dimensions A degrees wide and B degrees 
high. 

The frame time Тұ is the time in seconds between the instant the sys- 
tem scans the first point of the field of view, completes the scanning of every 
other point in the field, and returns to the first point. The rate at which com- 
plete pictures are produced by the system is called the frame rate F, Е- 1/Т;. 

The detector or detectors in the system do not produce informative 
video during the entire frame period Tf, because some time is required for 
performance of functions such as automatic gain adjustment, dc restora- 
tion, and scan mechanism retrace. The correspondence between active scan 
time and spatial coverage is demonstrated by Figure 1.5. The large rectangle 
in Figure 1.5 represents the spatial coverage which could be obtained by a 
scanner if all of the frame time Тү were used for scene coverage. The solidly- 
crosshatched areas represent the scan coverage lost if time is required for 
vertical scan retrace. In that case the system is said to have a vertical scan 
duty cycle n, of less than unity. The dashed-crosshatched areas represent 


*The distinction between fas focal length and fas frequency will be clear from the context. 
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coverage lost due to horizontal scan retrace or needed for automatic equaliza- 
tion functions. This is described by a horizontal scan duty cycle ny. The 
inner blank rectangular area is the active scan area, and the time period 
corresponding to it is "уты Гр. The overall scan duty cycle nsc is the product 


Ny Ny - 


SPATIAL COVERAGE 
SPATIAL COVERAGE LOST POSSIBLE IF ALL 


BECAUSE OF VERTICAL AVAILABLE TIME Т; 
SCAN RETRACE IS USED FOR 


SCANNING 


SPATIAL COVERAGE 

LOST BECAUSE OF 
HORIZONTAL SCAN 
RETRACE OR PER- 
FORMANCE OF IMAGE 
EQUALIZATION FUNCTIONS 


LL ll b Ll llli, 


SPATIAL COVERAGE AVAILABLE IN TIME T; Tie 


Figure 1.5 Scan efficiency and spatial coverage. 


Another useful number which characterizes the scan action is the de- 
tector dwelltime, rq, in seconds. The dwelltime may be understood by 
imagining that a geometrical point source of thermal energy is placed in the 
path of a detector scan. The time between the instant the leading edge of the 
detector intersects that point and the instant the trailing edge leaves the point 
is called the dwelltime. Alternatively, one may think of the dwelltime 
as the time required to scan through one detector angular subtense o. If the 
angular scan rate in object space is constant, the dwelltime will be the same 
throughout the field of view; otherwise the dwelltime will be a function of 
the position in the field of view. 

For example, if the scan is linear, a single element system with unity 
scan duty cycle and contiguous scanning lines has a constant dwelltime of 
та = аВ/АВ F. If the scan duty cycle is not unity, 


ЕСІ 
Tq “АВ E . (1.3) 
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Figure 1.6 illustrates these concepts. The subsequent chapters of this book 
analyze the elements of the thermal imaging process of Figure 1.1, and 
present elaborations on the simple system discussed above. 


DETECTOR DETECTOR 
ANGULAR ANGULAR 
POSITION POSITION 

AT TIMEt || AT TIMEt +74 


B DEGREES 


SCAN DIRECTIONS 


eee 


FLIR SENSOR 


Figure 1.6 Dwelltime and detector angular subtense. 


The state-of-the-art in component technology changes so rapidly that it 
would not be useful to discuss specific component implementations here. 
Rather the intent of this book is to serve as an introduction for the unini- 
tiated reader so that he may go forth armed with the answers to those 
embarrassingly simple questions no one likes to ask, such as “What defines 
the field of view?" or "Why is dc restoration desirable?" Detector, cooler, 
and video monitor data are readily available from manufacturers and have 
been covered adequately elsewhere, so we will concentrate on those aspects 
of thermal imaging not covered in the standard texts, the topics of optics, 
scanners, and system analysis. 


1.4 Sources of Thermal Imaging Information 


Most of the literature of thermal imaging systems relates to military 
developments and is classified or has limited accessibility. Articles in the 
open literature concerning thermal imaging, visual psychophysics, or ob- 
server performance using aided vision are usually found in the following 
journals: 

Journal of the Optical Society of America, 

Applied Optics, 

Optica Acta, 
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Proceedings of the Institute of Electrical and Electronic Engineers 
(IEEE), 

IEEE Journal of Quantum Electronics, 

IEEE Transactions on Electron Devices, 

Journal of the Society of Motion Picture and Television Engineers, 

Human Factors, 

Photographic Science and Engineering, 

Vision Research, 

Soviet Journal of Optical Technology, 

Aviation Week and Space Technology, 

Infrared Physics, 

Optical Spectra, 

Optical Engineering, 

and Electro-Optical System Design. 


The fundamentals of general infrared technology may be found in the 
texts by Hudson?, Jamieson, et al^, Wolfe?, Kruse, et al.°, Hadni’, 
Bramson? , Smith, et al.? , and Holter, et а1.19. 


1.5 The Example System 


For the sake of clarity, a single example system will be used in subse- 
quent chapters to demonstrate the proper application of the principles 
derived. The example system will be a purely hypothetical device of a type 
which enjoys considerable attention and use, a FLIR consisting of a parallel- 
beam scanner which is adaptable by telescope changes to a variety of applica- 
tions. This type of FLIR is represented by the “Universal Viewer” described 
by Daly, et al.! ! , and by the "Thermal Imaging System” of Laakmann!?. 
For our example we will hypothesize a FLIR operating in the 8 to 12 um 
spectral band, and having a 4:3 aspect ratio field of view and a 30 frames 
per second, 60 fields per second scan format. The fundamental principles of 
the system are illustrated in Figure 1.7. For the present we will not specify 
the nature of the telescope, the scan mechanism, the detector lens, the cold 
shield, or the detector array. We will specify only the following minimum 
necessary parameters: 


detector array individual element size: a = b = 0.005 cm 
detector lens focal length: fy = 5 cm 

detector lens clear aperture: Dg = 2 cm 

number of scan lines: 300 

number of dwelltimes per scan line: 400 

interlace: 2101. 
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These parameters further imply a field of view of 400 mrad by 300 mrad 
(22.9 x 17.2 degrees) and a detector angular subtense of 1 x 1 mrad for the 
basic scanner and detector lens without a telescope. In subsequent chapters 
we will consider alternative implementations of the unspecified features of 
this example system, such as array configurations, telescope types, scan 
mechanisms, signal processing schemes, and performance tradeoffs. 


DETECTOR 
COLD SHIELD 


ONE DETECTOR 
OF A MULTI- 
ELEMENT ARRA 


TELESCOPE SCANNING DETECTOR 
MECHANISM LENS 


Figure 1.7 Basic features of the example system. 
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CHAPTER TWO — THERMAL RADIATION 
THEORY AND ATMOSPHERIC TRANSMISSION 


2.1 Introduction 


Every object whose temperature is not absolute zero emits what is 
loosely called “thermal radiation". We will be concerned here only with that 
thermal radiation emitted in the 3 to 14 um wavelength region for two 
reasons. First, most of the energy emitted by an object at terrestrial temper- 
atures is emitted within this spectral wavelength band in an energy exchange 
between the electromagnetic field and the thermally-excited vibration- 
rotation electron energy level states of the body. Second, there are trans- 
missive atmospheric windows in this band which permit signal detection over 
comparatively long ranges. 

Thermal radiation at terrestrial temperatures consists primarily of self- 
emitted radiation from vibrational and rotational quantum energy level 
transitions in molecules, and secondarily from reflection of radiation from 
other heated sources. In many imaging applications, the actual mechanism is 
unimportant, and the only important factor is the existence of apparent tem- 
perature differences. The fundamental element in thermal radiation is the 
Planck blackbody radiation theorem. The basic premise of Planck's theorem 
is that thermal radiation is generated by linear atomic oscillators in simple 
harmonic motion which emit not continuously but in discrete quanta whose 
energy E is a function of the radiation frequency v given by E= hv, where 
his Planck's constant. Descriptions of Planck's theory are presented by 
Bramson! , Jamieson, et al.? , and Merritt and Hall’. 

Thermal radiators are characterized by their radiation emission effici- 
encies using three categories: blackbodies, greybodies, and selective radiators. 
This radiation efficiency property is described by the quantity spectral emis- 
sivity, €(À), which may be considered a radiation emission efficiency at a given 
wavelength A. The blackbody is an idealization; it emits and absorbs the 
maximum theoretically available amount of thermal radiation at a given tem- 
perature. A blackbody has e = 1 within a given wavelength range of interest, 
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and а greybody hase = constant < 1 within a specified band. A selective 
radiator hasO<e(A) <1 and may exhibit virtually any kind of single- 
valued dependence on А. These quantities are also dependent upon the 
viewing angle, but in analysis we usually deal with simple narrow-angle geom- 
etries and assume an average emissivity for all angles. 


2.2 The Radiation Laws 


The bare essentials of radiation geometry are depicted in Figure 2.1, 
where S is a two-dimensional radiating surface, dS is a differential surface 
area, and R(x,y) is the radius from the origin to a point (x,y) on the surface S. 
The cone defined by the movement of R around the periphery of S is des- 
cribed by a solid angle {2 in steradians: 


- dS _ dx dy 
Q =| | —— =| | —— 2. 
J R? (x,y) J R? (x,y) (2-1) 


SURFACE S 


X 


Figure 2.1 Radiation geometry. 


Before summarizing the radiation laws, we have to consider notation. 
Several competitive notation systems are in use, including the very general 
phluometric system^. However, the quantities recommended by Hudson? 
suffice and are widely used in thermal imaging. The following basic notation 
is used: surface area S (сіп? |, solid angle Q [sr], time t [s], and the partial 
differentials with respect to these quantities: 0/0S, 0/90, and 0/ot. 
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Letting U represent radiant energy, we define the following quantities: 


radiant power = P[watts] 4 dU/ot 


| | Е 244 _ 220 
radiant emittance = У [ма ст“ | = dP/dS = 35 дї 


| | ME A _ 07U 
radiant intensity = J[watt/sr] = dP/d& = aS ot 


ME 25123 - 0° О _ 
radiance = N[watt/cm^ sr] = 0°P/(0S 0&2) = 95 00. 21 _ 


The spectral distribution of each of these quantities is found by taking the 
partial derivative with respect to wavelength, and is denoted by a subscripted 
ХА as in spectral radiant emittance Му. 

One radiation law frequently assumed to be valid for analytical purposes 
is Lambert’s cosine law. This states that the radiant intensity J[watt/sr] from 
an ideal perfectly diffuse source is proportional to the cosine of the angle 
between the normal to the surface and the viewing angle. This is a good 
approximation for many materials at a near-normal viewing angle. For a flat 
surface radiating W[watt/cm? 1, integration over 27 steradians of W weighted 
by Lambert’s law yields 


М 2W[n. (2.2) 


Planck's blackbody radiation law (where the symbols have the values 
given in Table 2.1) is 


(От с? ] watt 
UA an-( x ) т зат) ES | (2.3) 


Equation 2.3 is more conveniently expressed in units of [ма ст um] 
for À in [um] as: 


№ OD LATUS (2.4) 


where c, and с, have the values given in Table 2.1. Planck’s law is the partial 
derivative with respect to wavelength of the total blackbody radiation W(T) 
in[watt/cm?]. This law must be expressed as a derivative with respect to À 
rather than as a simple function of À because it describes a distribution. That 
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is, at a specific wavelength W(A) = 0 whereas W # 0. Integration of Planck's 
law yields the Stefan-Boltzmann law, originally empirical, 


а 212 k* 
W(T) = WA (T,A) dA = TD T^. (2.5) 
J 15с2Һ3 
Table 2.1 


Radiometric Constants 


c (speed of light in a vacuum) = 2.9979 x 1019 [cm/sec] 

h (Planck's constant) = 6.6256 х 10?^ [watt sec? | 

к (Boltzmann's constant) = 1.38054 x 107? [watt sec/^K] 

о (the Stefan-Boltzmann constant) = 5.6697 x 107 2 [watts/cm? ^K^] 
c, = 3.7415 x 10* [watts шт“ /cm? ] 

с; = 1.4388 x 10^ [um °K] 

сз = 1.8837 x 102° [um? /sec ст? 1 


The constant term is called the Stefan-Boltzmann constant o, 


2т° k^ 
0 = >" e 2.6 
1 5c? h? (2-6) 


For some analytical purposes it is useful to have radiation quantities 
expressed in units of [photons/second]. Any quantity distributed with wave- 
length and expressed in units of [watts] may be expressed in units of 
[photons/second] by dividing by the energy of one photon, ћс/Х. For 
example, Planck's law converted to spectral radiant photon emittance 
Q4 [photons/sec cm? ] becomes 


_ (тс 1 
Qa = =) eser) en 


The expression for Q} іп [photons/sec cm? um] for Х in [um] is 


C3 


QA = M (e€2/AT 1) 


(2.8) 


with the constants as given in Table 2.1. 
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The spectral radiant emittance W) is plotted as a function of wavelength 
at several interesting temperatures in Figure 2.2. The spectral radiant photon 
emittance Оу is plotted for T = 300°K in Figure 2.3. 

The observed radiance from a solid object is the sum of the radiation 
transmitted and emitted by the body itself, and the radiation reflected by 
the body from other thermal sources. As the self-emitted radiation is a func- 
tion of the object’s temperature, it is useful to inquire how W4 changes with 
differential changes in T. The reader may easily verify that 


OW) C, Cy eC2/AT C2 ес2/ХТ W C2 Q 9) 
Хут? | 


= = WO = 
oT 6T? (ес2 /AT ү) À AT? (e€2 /AT—1) 


when e€2/AT >], The exact expression for ду, /ӘТ is plotted for terrestrial 
temperatures in Figure 2.4. 

Some useful defined quantities are: background radiant emittance 
(Wp), target radiant emittance (Мт), differential change іп W with differ- 
ential change in T (0W/0T), and radiation contrast (Ср). These are defined 
for target and background absolute temperatures Тт and Tp, and for specific 
spectral bands, by 


№ 
Ув = | Wy (Тв) ах (2.10) 
М 
Аҙ 
мт = | Wy (Tr) ах (2.11) 
i, 
Y 
aw _ f 9 WA (Tp) (2.12 
oT > oT 12) 
1 
Wr -Wg * 
Ср= =. (2.13) 
WT+Wp 


*If the definition of contrast as C = (Мт — Wp)/Wp is preferred, CR = C/2 for Wy = Wp. The term 
contrast has been applied to many different photometric and radiometric quantities. For a target 
level Ly and background Lp, contrast has been variously defined as: (Lr — Lpg)/(Lq + Lp), (Ly — Lp)/ 
Ly, (Lr — Lp)/Lp, and LT/Lg. As different authors use different definitions, the appropriate one 
will be noted when the term contrast is associated with specific numbers. 
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Figure 2.2 Planck's law Тог spectral radiant emittance at 
three background temperatures Тр. 
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Figure 2.3 Planck's law for spectral photon emittance at a 
background temperature of 300^ K. 
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Figure 2.4 The thermal derivative of Planck's law evaluated 
at three background temperatures. 
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To keep the notation simple, the spectral band is not explicitly noted in 
these quantities and is implied by the context. The majority of terrestrial 
scenes do not often have apparent temperature differences in excess of 
+20°K from ambient, so we will confine our attention to that range. 

The quantities Ут, Wp, W/T, and Ср calculated for an ambient tem- 
perature of 300°K and a target AT of 10°K for two commonly used atmos- 
pheric windows are given below: 


OW / watts 
Spectral Band — Wp(watts/cm?) Wr(watts/cm ) OT \ст?°К/ Cp for AT = 10°C 
3.5 — 5 um 5.56 X 104 7.87 X 10-4 2X108 0.172 
8 — 14 um 1.72 X 102 1.99 X 102 2.62 X 104 0.074 


Table 2.2 gives the dW/0T’s for other spectral bands and background temper- 
atures, and Figures 2.5 and 2.6 plot the most often used results. Figure 2.7 
compares the 3.5 to 5 um and the 8 to 14 um spectral bands. 
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Figure 2.5 Values of the integrated thermal derivative of Planck’s law 
for the 8 to 14 um band as a function of 
background temperature. 
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Figure 2.6 Values of the integrated thermal derivative of Planck’s law for the 
3.5 to 5 um band as a function of background temperature. 
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Figure 2.7 Comparison of the integral of the thermal derivative of Planck's law 
for the two spectral bands as a function of background temperature. 
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Table 2.2 
Selected Values of the Spectral Integral of the Thermal Derivative 
of Planck’s law. 


OT X oT cm? ?K 
Мит) Хит) Tp =280°K Tp = 290°K Тв = 300°K Тр = 310°К 
3 5 1.1 X 10° 1.54 X 105 21 X 109 2.81 X 109 
3 5.5 2.01 X 10° 2.73 X 105 3.62 X 10° 4.72 X 109 
3.5 5 1.06 X 105 1.47 X 109 2X 105 2.65 X 105 
3.5 5.5 1.97 X 105 2.66 X 105 3.52 X 105 4.57 X 105 
4 5 9.18 X 10°6 1.26 X 105 1.60X 105 | 223x 105 
4 5.5 1.83 X 10° 2.45 X 105 3.22 X 105 4.14 X 105 
8 10 8.47 X 105 9.65 X 109 1.09 X 104 1.21 X 104 
8 12 1.58 X 104 1.77 X 104 1.97 X 104 2.17 X 104 
8 14 2.15 X 104 2.38 X 104 2.62 X 104 2.86 X 104 
10 12 7.34 X 105 8.08 X 105 8.81 X 105 9.55 X 105 
10 14 1.3 X 104 1.42 X 104 1.53 X 104 1.65 X 105 
12 14 5.67 X 1079 6.10 X 109 6.52 X 105 6.92 X 10° 


The radiation contrast Cg varies with changes in target and background 
temperatures for the two spectral bands as shown in Figures 2.8 and 2.9. A 
useful number to remember is the differential change in radiation contrast 
with a differential temperature difference, ОСр /9Т. From the above calcula- 
tions, we see that ӘСр /дТ is approximately 0.7%/°K for the 8 to 14-шп band 
and 1.7%/°K for the 3 to 5-шт band. This clearly shows that any system 
which linearly converts thermal energy to visible energy will not exhibit the 
high contrasts necessary for acceptable image quality. Thus it is necessary to 
have some sort of background radiance subtraction scheme in thermal 
imaging. 


Thermal Radiation Theory and Atmospheric Transmission 


3 
Тр = 240°К 
2 Tp = 260°K 
са = ТВ Tg = 280°K 
Wr+We 4 Я Тр = 300°К 


0 5 10 15 20 
T—Tp [°K] 


Figure 2.8 Radiation contrast for the 8 to 14 um band as a function 
of target-to-background temperature difference for 
four background temperatures. 
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Figure 2.9 Radiation contrast for the 3.5 to 5 um band as a function 
of target-to-background temperature difference for 
four background temperatures. 
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The radiation contrast is most simply evaluated for the case of black- 
body target emission over all wavelengths. In that case, the total radiant 
emittance W(T) is given by equation 2.5, 


W(T) = от“, 
SO 


oW(T) 


- 3 
3T 4 0717, 


and 
Cy = 4 oT? AT 
Rg = — == __ 
2 014 * 401? AT 


__2AT 
T+2 AT 


| 


—T— (2.15) 


for small AT. 

Then Ср /АТ = 2/T, and for terrestrial temperatures poorer contrast 
is achieved broadband than is possible with the two narrow spectral bands 
we have considered. These are the basic concepts of thermal radiation theory 
which we will need here. Expanded treatments are given in any of references 
] through 9. 


2.3 Atmospheric Transmission 


Thermal radiation is attenuated in passage through terrestrial atmos- 
pheres by the processes of absorption and of scattering by gas molecules, 
molecular clusters (aerosols), rain, snow, and suspensions such as smoke, fog, 
haze, and smog. The following molecules (in order of importance) absorb 
radiation in the thermal infrared in broad bands centered on the wave- 
lengths indicated: water (2.7, 3.2, 6.3 um), carbon dioxide (2.7, 4.3, 15 um), 
ozone (4.8,9.6, 14.2 um), nitrous oxide (4.7, 7.8 um), carbon monoxide 
(4.8 um), and methane (3.2, 7.8 um). Molecular absorption is the greatest 
source of extinction except in dense suspensions, and water vapor, carbon 
dioxide, and ozone are the most significant absorbers. In the lower atmos- 
phere, absorption by nitrous oxide and carbon monoxide may usually be 
neglected. The 6.3 um water band and the 2.7 and 15 um carbon dioxide 
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bands effectively limit atmospheric transmission in the 2 to 20 um thermal 
spectral range to two atmospheric windows. These are the 3.5 to 5 um and 
the 8 to 14 um bands. 

At a specific wavelength Л for a specific atmospheric state, the atmos- 
pheric transmission 7 д (№) is given by the Lambert-Beer law, 


TACA) -ехр(-Ү(ОК), (2.16) 


where R is the range or path length and in the notation of McClatchey, 
et al.! 9, у) is the extinction coefficient. The extinction coefficient is the 
sum of an absorption coefficient о(А) and a scattering coefficient k(A), 


YA) = oA) + KA) (2.17) 


The scattering and absorption coefficients are in turn sums of molecular 
and aerosol components 


O(A) = oq, (X) + о, (А) (2.18) 
and 
kA) = kp A) + k A). (2.19) 


The extinction coefficient is a complicated function of ^, so over a 
broad spectral band the atmospheric transmission prediction problem con- 
sists of an integration over wavelength and over an accumulation of absorbers. 
The average transmission for a specific band A, to A, is 


№ 
Ta y. exp CONRAN (2.20) 
А 


1 


А broad survey of atmospheric transmission definitions and problems is 
given by Farrow and Gibson! ! . Reliable measurements of atmospheric trans- 
mission in the infrared are rare. The best are still those well known measure- 
ments of Taylor and Yates! 2, reproduced in Figures 2.10, 2.11, and 2.12, 
and the measurements of Streete! shown in Figures 2.13 and 2.14. The 
detailed mechanisms of atmospheric effects have been presented elsewhere? >7, 
so we will not consider them here; rather, we will identify and use a reliable 
analytical model. 


32 Chapter 2 


1000 FT. PATH, 0.11 PRECIPITABLE CENTIMETERS 
3.4 STATUTE MILE PATH, 1.37 pr. cm 
10.1 m PATH, 5.2 pr. cm 


TRANSMISSION (PERCENT) 


2.8 2.9 3.0 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 3.9 4.0 4.1 4.2 
WAVELENGTH [um] 


Figure 2.10 Taylor and Yates 2.8 to 4.2 џт data adapted from reference 12. 
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Figure 2.11 Taylor and Yates 4.3 to 5.6 um data adapted from reference 12. 
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Figure 2.12 Taylor and Yates 6.5 to 14.0 um data adapted from reference 12. 
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Figure 2.13 The 3 to 5 um data of Streete adapted from reference 13. 
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Gaseous molecular absorption occurs when vibrational and rotational 
atomic motions in a molecule are allowed such that a net dipole moment 
change is produced. Then energy exchange with the electromagnetic field 
can result by resonant absorption and emission of quanta. Consequently, 
individual symmetric molecules do not exhibit vibration/rotation absorption 
because they cannot generate a net dipole moment. Also, molecular absorp- 
tion by the electronic bands of atoms is insignificant. Thus for these two 
reasons individual molecules of the three main atmospheric species, diatomic 
nitrogen and oxygen, and monatomic argon do not absorb infrared. In a gas, 
however, additional states are introduced by the effect called pressure 
broadening, and appreciable absorption occurs. 

Molecular absorption causes extinction of a beam of thermal radiation 
because the gas reradiates the absorbed energy in all directions rather than 
in the original direction of propagation. A good explication of molecular 
absorption is given by Anding!*, who presents the Elsasser, random, and 
random-Elsasser models of molecular absorption. Anding recommends ap- 
propriate models for the various molecular species, and compares predictions 
using these models with empirical data. McClatchey, et al.!?, is a good 
source for the analysis of the effects of molecular scattering and of aerosol 
extinction. The regular texts on infrared technology contain summaries of 
the various single line and band absorption models. 

At high relative humidities, water vapor molecules tend to cluster and 
to produce an aerosol, a condition intermediate between a humid atmosphere 
and a fog or haze. Using reliable molecular absorption and scattering models, 
and a reliable aerosol scattering model, Hodges!” compared predictions to 
empirical results and found that liquid aerosol absorption is quite significant 
at high relative humidity. For low altitudes and moderate ranges at which 
water vapor concentrations are low, Hodges asserts that molecular absorption 
alone reasonably accounts for extinction. 

Scattering by molecules, aerosols, fogs, hazes, and clouds is describable 
by the mechanism known as Mie scattering. The Mie theory is valid for small 
particle, or Rayleigh, scattering where the particle is much smaller than a 
wavelength, and for large particle, or nonselective, scattering where the 
particle size p is much greater than a wavelength. Since Rayleigh scattering 
exhibits a A4 dependence, molecular scattering is insignificant beyond about 
2 um. Тһе result is that scattering by molecules (р << №) is insignificant 
compared to absorption, and only molecular aggregates produce scattering. 
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The best single source of models for all sources of atmospheric extinc- 
tion is the report of McClatchey, et al.!°. They detail methods for predicting 
the effects in the 0.25 to 25 um region of the following mechanisms: 


e Discrete line molecular absorption for all significant gases; 

€  Pressure-broadened line molecular absorption by water vapor; 
e Molecular scattering for all species; 

• Aerosol extinction for all species. 


The paper also gives tables of characteristics for seven model atmospheres: 
tropical, mid-latitude summer, mid-latitude winter, subarctic summer, sub- 
arctic winter, clear (23 km visibility), and hazy (5 km visibility). 

Figures 2.15 through 2.19 are taken from McClatchey, et.al. These 
figures show transmittance as a function of wavelength for the appropriate 
absorber density or for path length, for molecular absorption by non- 
pressure-broadened water vapor, for the pressure-broadened water vapor 
continum, for the uniformly mixed gases, for molecular scattering, and for 
aerosol extinction. The curves may be used directly for a path through a 
uniform atmosphere. Otherwise an equivalent path must be calculated per 
the original paper. The paper contains many other graphs for individual 
species. These curves are applied by tracing the appropriate transmittance 
scale onto a transparent sheet. The traced scale is then used by placing the 
desired absorber amount (or path length) on the horizontal scale line with 
the vertical scale over the wavelength of interest. The transmittance is read 
off where the curve crosses the scale. 

The absorber concentration of water vapor may be described in several 
ways. The most fundamental measure is the absolute humidity, the water 
mass in grams of a one-cubic centimeter volume of air containing water 
vapor at some specified temperature and pressure. The absolute humidity 
H, [gm/cm? | isa function of water Vapor partial pressure P[mm of mercury] 
and atmospheric temperature TA [°К]. The absolute humidity H,’ of a mass 
of air saturated with water vapor is given by 

P 


Н; = 2.89 X 104 Ta (2.21) 


where the partial pressure P also depends on the temperature T,. A more 
familiar measure of absorber concentration is the relative humidity. 
Relative humidity is the ratio of absolute humidity under a specified set of 
conditions to the absolute humidity when the volume is saturated under the 
same conditions. 
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Figure 2.19 Transmission by aerosols, adapted from reference 10. 
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Absolute humidity and relative humidity do not directly indicate the 
total amount of absorber in a given path length. The precipitable water is the 
quantity which combines path length and relative humidity in a meaningful 
measure of total path absorption. Precipitable water W is defined in terms of 
depth of the column of liquid which would result from the condensation of 
all the vapor in a cylinder of air having a particular path length, and is 
usually expressed in units of precipitable centimeters of water per kilometer 
of path length [pr cm/Km]. 

Precipitable water is expressed in terms of liquid water density d, 
relative humidity H, (expressed as a decimal fraction), and absolute humid- 
ity H, as follows: 


H, H, ШШЕ 105 H, H, Б | 
Pom cm? _ 


ст“ Km 
W 
 d[gm/cm?] - d [gm/cm? ] 
(2.22) 
- - роз На r рен | 
Since d is approximately constant at 1 gm/cm?, 
W = 105 H, H, pom. (2.23) 
Km 


The total depth of precipitable water in a path is expressed in units of pre- 
cipitable centimeters. For a path having uniform relative and absolute 
humidities, the total precipitable water is found by multiplying W by the 
path length R in kilometers. The precipitable water and the absolute humidity 
at saturation are shown in Figure 2.20. 

Moser!" has determined that at saturation and at sea level, the precip- 
itable water W' is given by 


Ww’ ped = 0.492 + 3.094 X 10? Т, 
Km 
(2.24) 
+0.95 X 10? T4? + 2.888 X 105 Ty? 
to an accuracy of +1.6% for OC & ТА <40°C and ОС Hg <1, and by 


W' рот) = 0.502 e 0.06 ТА (2.25) 


to an accuracy of +3.6% for O & TA < 35°C. 


Thermal Radiation Theory and Atmospheric Transmission 


33 


Сә 
Re ee Е 
N N NO NR о со 
о ч о о © © © 


2.5 


~ 
= > - N N N м м 
~ с о о > мо Ff сл 


1.5 


(100% RELATIVE HUMIDITY) 
o o 


W 
PRECIPITABLE WATER 
AND AT SEA LEVEL IN 


AT SATURATION 

H 
ABSOLUTE HUMIDITY 
AT SATURATION AND 


UNITS OF [pr. cm /km] 


————— а — .-а 
о > N Ù ~ сло м с о OC —э ку о A 


pe | 
-40 -30 -20 -10 0 10 20 
AIR TEMPERATURE °C 


CO 
© 


Figure 2.20 Precipitable centimeters of water vapor per kilometer 
of path length and absolute humidity versus temperature at saturation. 
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Multiplication of the above expressions by the relative humidity H, yields 
the precipitable water W at other than saturated conditions. 

Hodges!5 calculated the average transmission in seven nominally 
“3-5 um" and “8-14 um" bands using the empirical Yates and Taylor data, 
found the contributions of each extinction process, and derived coefficients 
which permit predictions to agree reasonably well with Yates and Taylor. 
The plot in Figure 2.21 of Hodges’ reduction of Yates and Taylor data in 
8 to 14 um bands shows an approximately linear relation between average 
band transmission TA and total precipitable water vapor Wr [pr ст] in the 
path. This interesting result is fitted! é well by: 


тд 7.8326 — .0277 Wp. (2.26) 


The residual absorption loss of .167 at Wy = 0 presumably is due to extinc- 
tion by other species. Plots of the 3 to 5 um data do not show any clear 
dependence on Wr, reflecting the fact that CO, absorption is more important 
in the 3-5 um band than in the 8-14 um band. 

The principles of the paper of McClatchey, et al.! ° are incorporated in 
a computer program written by Selby and McClatchey!®. This program was 
used by MacDonald!? to generate the curves shown in Figures 2.22 through 
2.42. To provide a comparison of transmission loss at 3 to 5 um and at 8 to 
12 um, MacDonald selected three slant ranges: 4, 8, and 16 nmi, and three 
altitudes: 300 feet, 4000 feet, and 12,000 feet. Two model atmospheres 
were used: the tropical and the 1962 U.S. standard, both with and without 
haze. Transmission variations with visibility at various altitudes and ranges 
are shown for the 8 to 12 um band in Figures 2.24 through 2.27, and for the 
3 to 5 um band in Figures 2.28 through 2.33. The differences between 
tropical and U.S. standard atmospheres for various altitudes and slant ranges 
for the 8 to 12 um band are shown in Figures 2.34 through 2.42. 

Excluding source and detector characteristics, the results indicate that 
in mid-latitude dry air masses, transmission in the 8 to 12 um band is 
superior because CO, heavily absorbs 3 to 5 um radiation. However, in 
clear high humidity tropical air masses, the 3 to 5 um range is clearly superior 
due to heavy absorption of 8 to 12 um radiation by water vapor. The fact 
that haze attenuates the 3 to 5 more than the 8 to 12, however, tends to 
make the 8 to 12 um range the preferred choice because long haze-free 
slant paths through humid atmospheres are not frequently encountered. The 
primary quantity of interest, however, is not absolute transmission but the 
ratio of thermal sensitivity to transmission. In this respect, the 8 to 12 um 
region is usually superior to the 3 to 5 um region because of an inherent 
sensitivity difference between the two regions even when no transmission 
losses are present, as discussed in Section 5.12. 
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Figure 2.21 Average bandpass transmission versus total precipitable 
water for five bands, adapted from reference 15. 
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A comparison of atmospheric transmission over long sea paths in the 
two spectral bands is given by Jaeger, et al??. Using good InSb and 
(Hg,Cd)Te detectors in a radiometer, they showed that over 1 to 13 Km 
paths in the background-variation-limited noise (sea surface changes), ex- 
perimental predictions of 8-14 um signal transmission superiority are correct. 
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Figure 2.22 Atmospheric transmission. 
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Figure 2.23 Atmospheric transmission. 
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Figure 2.24 Computed atmospheric transmission. 
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Figure 2.25 Atmospheric transmission. 
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Figure 2.26 Atmospheric transmission. 
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Figure 2.27 Atmospheric transmission. 
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Figure 2.28 Atmospheric transmission. 
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Figure 2.29 Atmospheric transmission. 
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Figure 2.30 Atmospheric transmission. 


Chapter 2 


Thermal Radiation Theory and Atmospheric Transmission 


TRANSMISSION 


SLANT RANGE — 8 NAUTICAL MILES 
TROPICAL ATMOSPHERE 


NO HAZE, NO AEROSOLS 
------- HAZE WITH 5 km VISUAL RANGE 


12000 FT. ALTITUDE 
4000 FT ALT. 


1.0 300 FT ALT. 


0.1 


01) 


001 


Figure 2.31 Atmospheric transmission. 
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Figure 2.32 Atmospheric transmission. 
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Figure 2.33 Atmospheric transmission. 
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Figure 2.34 Atmospheric transmission. 
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Figure 2.35 Atmospheric transmission. 
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Figure 2.36 Atmospheric transmission. 
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Figure 2.37 Atmospheric transmission. 
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Figure 2.38 Atmospheric transmission. 
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Figure 2.39 Atmospheric transmission. 
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Figure 2.40 Atmospheric transmission. 
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Figure 2.41 Atmospheric transmission. 
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CHAPTER 3 — LINEAR FILTER THEORY 


3.1 Applicability 


There is frequent need in thermal imaging practice to describe the 
responses of a system to various signals, to predict spatial resolution, and to 
devise image enhancement schemes. A tool which is useful for this purpose is 
linear filter theory!;? , a branch of Fourier transform analysis used for analy- 
zing the signal response of the class of linear, invariant, stable systems. Linear 
filter theory is applicable to optical, electro-optical, mechanical, and electronic 
devices. It originated in the description of electrical networks and servo- 
mechanisms and was extended to optical systems. Linear filter theory forms 
an indispensible part of image analysis and is necessary to every thermal 
imaging system design evaluation. 

Goodman! defines a system as a mapping of a set of input functions into 
a set of output functions. We will consider here only single-valued system 
functions or one-to-one mappings, so that noisy systems are excluded. 
Figure 3.1 indicates the coordinate scheme and nomenclature we will use. 
A point in object space is located by referring to a set of Cartesian angular 
coordinates (x,,y,) measured іп the x and y directions between the optical 
pointing axis and the line of sight from the aperture to the point. The image 
plane angular coordinates (x;,y;) are measured from the eye, and for a dis- 
tortionless system they are given by х; = Mx, and y; = Мур, where M is the 
angular magnification. 

Linear filter theory is applicable only to optical systems which map 
object distributions into image distributions by the process of convolution. 
Most imaging systems use convolutionary processes to the maximum extent 
possible because such processes generally produce the best imagery. Con- 
sequently it is important to understand convolution and to be able to recognize 
a non-convolutionary process. 


3.2 The Convolution Integral 


Consider a two-dimensional object whose radiant emittance is described 
by an object distribution O(x,,y,) given as a function of object space angular 
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Cartesian coordinates (хо,Уо). Let the operation of the imaging system on апу 
object be denoted by the operator S | | Then the image distribution will 
be S1O(x,, y) . 

A system is defined to be linear if for arbitrary complex scalar weightings 
c and d of arbitrary object functions О, (х,у) and О, (х,у), the following 
equation is satisfied: 


S со, (хо Yo) + dO, (Xo Yo)} = cS lo, (Xo Yo) + dS lo; (Xo Yo} ° (3. 1) 


A consequence of system linearity is that any object function can be repre- 
sented as a series, or in the limit as an integral, of simpler functions. To 
assure that this important simplification can be made, attention must be con- 
fined to systems which are, or can be approximated to be, linear operators in 
time and space on the signals of interest. 

In general, object functions are four-dimensional functions of space and 
time, but for analytical purposes we usually consider that depth is not signifi- 
cant and therefore describe objects as functions of two angular spatial coordi- 
nates and one time coordinate. It is convenient to decompose an object 
function into a weighted integral of Dirac delta functions using the “‘sifting 
property” of the Dirac delta 6(х). The mathematical expression of the sifting 
property in two dimensions is: 


оо 


O(% Vo) = ffot.m 6 (к -E 8 (Yo - m dan, (3.2) 


-оо 


апа its geometrical interpretation is shown in Figure 3.2. The meaning of this 
integral is that an arbitrary function is representable as an infinite sum of 
weighted and displaced delta functions. 
The image distribution l(xj,y;) is 
(ұу) = 5 o6.) 
(3.3) 


=s | ff Om ê 65 9 Yon) ағат. 
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Figure 3.2 The sifting property of the delta function. 
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Assuming linearity, this becomes 


оо 


хуй = ff Оет 580-0 8 oum] agan: (3.4) 


- ОО 


Thus the image function is a weighted sum ОҒ the system responses to the 
component delta functions of the object. 

The above equation is called a superposition integral, and 
S 15(x,-£) 6 (yo-)] is called the point spread function orimpulse response. 
Equation 3.4 is called the superposition integral because it expresses a general 
function I(x;,y;) as an integral of superimposed responses to infinitesimally 
spaced three-dimensional delta functions. In electromagnetic field theory, 
the quantity analogous to the point spread function is Green's function for a 
point charge or current source, and in electric circuit theory, it is the response 
to a current or voltage impulse. 

We generalize to the three-dimensional case of interest by the expression 


I(x,y;.t) - slo«, Yo ә) 
| (3.5) 


J J J облы? S [50-0 è Grm 5 t) tima 


where it is assumed that any time delay between input and output is 
negligible. 

The superposition integral is further simplified by requiring that the 
system be spatially invariant, that is by requiring independence of the 
impulse response of the system on the time and position of the impulse.* If 
we assume unity magnification M for simplicity in all that follows, then the 
system response to an impulse becomes 


~ 


sfat 6 (Y o-n) 6 (e+) = 515064) б (yy) 6 (t| 
(3.6) 


A 
- r(x, ,y;,t Бе ). 


*Actually, framing thermal imaging systems аге not time-invariant, because a given object point 
produces a response only a small fraction of the time. However, we are interested in the effect of the 
system when. viewed by a time-integrating device such as the human visual system, so that when the 
object is static and there is no motion between the object and the system, we may consider the 
system to be time-invariant. 
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The following graphical argument demonstrates that the superposition 
integral simplifies to a form called the convolution integral when the system 
is spatially invariant so that the form of the impulse response r is constant 
with coordinate translations. Consider the object distribution and spread 
function shown in Figure 3.3. The image distribution is found by decomposing 
the object into weighted delta functions, replacing each of these deltas with 
weighted spread functions, and summing, as represented in Figure 3.4. 


O(x) r(x) 


(a) (b) 


Figure 3.3 (a) An object distribution and 
(b) a spread function. 


Thus the value of the image distribution at a point x; is found by 
summing the contributions at x; of impulse responses centered at x; - Е and 
weighted by the values of O at all points £. Mathematically this is given by 


(x)- | офто: at, 3.7) 


which is a convolution integral and is symbolized by 


I(x) = O(x) s r(x). (3.8) 
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Figure 3.4 Construction of the image function by decomposition 
of the object function (a), replacement with weighted spread 
functions (b), and summation (c). 


An alternative way to look at this which may not be intuitively obvious 
is to recognize that for a particular x;, the variable in the above equation is £, 
and that r(x;-£) = r[-(&-x;)] is the reflection of r(£) centered on x;. Thus this 
integral may be interpreted as shown in Figure 3.5 as the integral over the 
product of the object and the reflected and displaced spread function. Thus 
it is clear that 


ТО ,У 5; Ent) — I(X, — Ё, Уі — 1, t— t’), (3.9) 


both from the above argument and because r must be invariant with coordi- 
nate translations. Then for the three-dimensional case, the convolution inte- 
gral is 


(х,у) = | | | O(E,n,t') r(x-Eyjn.tt) адтае 


(3.10) 
= Ox, y, t) * Хур). 
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Figure 3.5 Convolution. 


3.3 The Fourier Transform 


The Fourier transform representation we will use here is that the for- 
ward transform is defined in one dimension by 


Flo oo} = || О (хје 2781 ах А О (f). (3.11) 


-00 


The quantity fy is a Cartesian spatial variable called spatial frequency, 
and is analogous to the familiar temporal or electrical frequency ft. The units 
of ft are Hertz, and spatial frequencies have units of cycles per milliradian. 
The meaning of spatial frequency should be clear from Figure 3.6 which re- 
presents a one-dimensional sinusoidally varying light source having a spatial 
period 0,. The source is viewed at a distance R such that the small angle 
approximation 


2 sin (x) >) (3.12) 


can be made. Figure 3.7 shows how object and image spatial frequencies and 
angular magnification are related for the case of an image viewed on a screen. 
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Figure 3.6 Spatial frequency. 
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Figure 3.7 Angular magnification. 


The periods of the object and image patterns are 2ag/Rg and 2а;/К; 
respectively. The corresponding spatial frequencies are Ro/2ag, and К;/2а;, 
and the angular magnification is 


M = Eito (3.13) 
Roa; ` . 
In three dimensions, 
H - +tf 
F fo oy. = || || | О (xy, ђе Mx TY fy FED ay ду dt 
- оо 
(3.14) 
A “чы 
= O (fx, fy, Г). 
The inverse transform* is 
Р |бво | = jf Oo an = 00) (3.15) 
-оО 


*Normalization factors of 1/27 will be ignored in subsequent sections for simplicity. 
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in one dimension, and 


| ~ 
Е-! (ШАЙ) = Om? ба comet yt ar ar ar, 
| (3.16) 


А 
= O(x,y,t) 


in three dimensions. 
Successive forward and inverse transformations yield the original func- 
tion, that is 


Ез {Е{О(х)} } = O(n. 


A detailed discussion of the Fourier transform and its applications is given by 
Bracewell’. 


3.4 The Convolution Theorem and the Optical Transfer Function 


When a system is defined by a series of successive convolutions involving 
a complicated object and different spread functions, the process is difficult to 
visualize and to handle analytically. The convolution theorem of Fourier 
analysis allows this process to be transformed to the frequency domain where 
it is easier to manipulate. This theorem states that the Fourier transform of 
the convolution of two functions equals the product of the transforms of the 
two functions. For the image equation (3.10) this is expressed as 


F{Oxr} =F {O} - F{r}. (3.17) 


An abbreviated notation for this is 


~ 


T=O er. (3.18) 


The Fourier transform T of an impulse response r is called the optical 
transfer function (OTF). If each component impulse response of a system is 
linear and is independent of the other component impulse responses, the 
system impulse response r, is produced by a series of convolutions, 


ITI, Ij ж... ЖТ (3.19) 
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Then by the convolution theorem, the cascaded transfer functions can 
be multiplied together to produce the system OTF 


n 
T,;7 Il tm (3.20) 
m=1 


and the image spectrum is 


~~~ wd n ғ“ 
[=O П гу. (3.21) 
m= 


Thus the complicated operation of multiple convolutions can be replaced 
by the simpler process of finding the inverse transform of the product of the 
transforms of the convolved functions. This also allows us to think of the 
imaging process as a selective weighting in the frequency domain of the object 
spatial frequency spectrum by the system, as shown in Figure 3.8. Thus the 
OTF is a measure of an imaging system’s ability to recreate the spatial 
frequency content of a scene. 


^i 


O? 


Figure 3.8 Attenuation of object frequencies by the MTF, as a 
function of spatial frequency f. 
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The OTF is a complex quantity whose modulus or absolute value is a 
sine wave amplitude response function called the modulation transfer func- 
tion (MTF). The value of this function is normalized to unity at or near zero 
spatial frequency by convention. The argument in radians of the OTF is 
called the phase transfer function (PTF) and indicates the spatial frequency 
phase shift introduced by the system. The complex representation of the 
OTF is: 


OTF(f) = MTF(f) ei PTF(f). (3.22) 


Much of the OTF theory in the literature of optical and television 
systems is equally applicable to thermal systems if four conditions are satis- 
fied. These conditions are that: 1) the radiation detection is incoherent, 
2) the signal processing is linear, 3) the imaging is spatially invariant, and 
4) the system mapping is single valued (specifically, non-noisy). The last 
three conditions are regularly violated in thermal imagers. The infrared optical 
system will be spatially variant if its impulse response varies from the center 
to the edge of the field due to aberrations, and if scan position is nonlinear 
with time, causing the electrical filter to have different OTF's for the same 
frequency in different parts of the field. The detector array samples periodi- 
cally in the direction perpendicular to the scan, thereby producing non- 
convolutionary imagery, and each detector is electrically noisy, violating the 
one-to-one mapping requirement. The analog electronics may be both non- 
linear and noisy. Nonlinear signal compression may be used intentionally in 
the video processing to improve the system dynamic range, and the electro- 
optical signal conversion by the display may be nonlinear. 

In addition to these characteristics, thermal imaging systems may differ 
from other optical and electro-optical devices in four nontrivial ways. First 
and most important, a thermal imager usually subtracts the average scene 
value from the video, displaying only variations around the average value, 
and usually adds an arbitrary constant signal to the displayed image. Second, 
electronic signal shaping or special display techniques may be used to generate 
spread functions with negative amplitudes, producing MTF's with greater 
than unity amplitude and causing an OTF definition and normalization 
problem. Third, many different quasi-linear system operating points may be 
identified whose corresponding OTF's may not be identical. Fourth, the 
image may be formed by an insufficiently-filtered sampling process so that 
the OTF exists only in the direction normal to the direction in which the 
sampler operates. 
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Consequently when we speak of the OTF of a thermal imaging system, 
we very likely speak of a shaky construct. This construct is nevertheless 
irreplaceable as a design, analysis, and specification tool because it is one of 
the best measures of system image quality. Therefore, extreme care must be 
used in the analysis, specification, and testing of OTF to ensure that all of 
the above-mentioned deviations from the OTF existence conditions are 
avoided or accounted for. 

Since many thermal imaging systems do not satisfy the OTF-existence 
requirements in all directions, it is often necessary to resolve a point spread 
function in a direction where the conditions are satisfied. This results in the 
determination of a line spread function (LSF) which is the response to a long 
infinitesimally narrow line source composed of a string of delta functions. 
Such a geometrical line source contains only spatial frequencies in a direction 
perpendicular to the long dimension and contains all frequencies from zero to 
infinity with unity normalized amplitude as shown in Figure 3.9. The Fourier 
transform of the LSF is therefore unidirectional. 
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Figure 3.9 A line source and its spectrum. 


As the analog signal corresponding to this line passes through each com- 
ponent of the system, its frequency content is attenuated selectively. The 
image that appears at the output of the system consequently is not precisely 
reproduced and the effect observed is a spreading of the line image. An 
example LSF, MTF, and PTF for a thermal imaging system in the scan 
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direction are shown in Figure 3.10. Note that the asymmetrical LSF indicates 
the presence of phase shifts. In thermal imaging systems, the LSF and MTF 
are usually dominated by the detector angular subtense a, so the x axis may 
be normalized to а and the frequency axes normalized to 1/a. 
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Figure 3.10 Ап LSF, MTF, and PTF. 


The Fourier theory of optics was introduced independently by 
Duffieux^ іп 1946, Schade? іп 1948, апа Selwyn® in 1948. The subject is 
treated generally in such texts as Goodman’ , O'Neill? , Françon” and Linfoot?. 
Central problems in optics have been analyzed in references 9 through 17, and 
several excellent tutorial papers on the OTF of optical and electro- 
optical systems are provided by references 20 through 33. 

Fourier analysis in optics is similar to Laplace analysis in electrical cir- 
cuit theory, with a few exceptions. Time filters differ from spatial filters in 
two crucial ways, as indicated by O'Neill? . First, time filters are single-sided 
in time and must satisfy the casuality requirement that no change in the 
output of a time filter may occur before the application of an input, whereas 
optical filters are double-sided in space. As is shown in many texts on opera- 
tional mathematics or on electrical filter analysis, this difference requires the 
use of the single-sided Laplace transform for time filters rather than the 
Fourier transform. Second, electrical signals may be either positive or nega- 
tive, whereas optical intensities are always positive. 
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3.5 Convolutionary and Non-convolutionary Image Formation 


Since linear filter theory is such a useful image analysis tool, it is often 
worthwhile to approximate a non-convolutionary process as convolutionary 
wherever the deviations from ideal behavior are minor. For example, an 
imaging process which exhibits moderate spatial variance may be approx- 
imated as convolutionary over small object areas called isoplanatic patches. 
An image formed by samples of a convolution will sometimes be sufficiently 
filtered following the sampling process to be nearly convolutionary. 
Similarly, the effects of slight noise or slight nonlinearity may usually be 
safely ignored. 

One of the most important convolutionary processes in thermal imaging 
is the dissection of an image by a scanning detector. Consider Figure 3.11, 
where a square detector of dimensions “а” by “а” scans an image plane 
(x,y) on a scan line center y = y'. If the detector has a uniform responsivity 
over its surface and instantaneously integrates all incident radiation to pro- 
vide asingle electrical signal, the detector signal at a point (x’,y’) in the scan is 
given by I(x’,y’) and depends on the image plane energy distribution O(x,y) 
as follows, 


+ 
|| О(х,у) dx dy. (3.23) 


Now define the detector impulse response factor as r(x) given by: 


r(x) = 1, for -5<x< 
(3.24) 
= 0, otherwise. 


As the detector scans, x has a continuum of values x’, but у has only the 
value y’. Thus, we may rewrite the image equation as 


Y tSr æ% 
I(x',y)) = | І O(x.y) r(x!-x) dx | dy. (3.25) 
‚ a |-% 
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Figure 3.11 A scanning detector. 


The inner integral is a convolution, so the process is convolutionary 
in the scan direction. The outer integral indicates spatial variance, so the 
process is nonconvolutionary perpendicular to the scan direction. Broadcast 
television is a familiar example of this type of scanning image formation. 
Other examples of nonconvolutionary imaging abound. The eye is non- 
convolutionary in the strictest sense because finite detectors are used and 
because resolution degrades from the center to the edge of the visual field. 
Halftone photography is non-convolutionary because it is binary in intensity 
at a given point and is therefore both nonlinear and spatially variant. 
Ordinary photography is processed nonlinearly and exhibits an adjacency 
effect in development which makes it spatially variant. However, such pro- 
cesses may be treated as convolutionary if an appropriate magnification 
scale is chosen or if attention is confined to limited areas. 


3.6 Shorthand Fourier Analysis 


Before considering specific impulses responses and their transforms it is 
useful to make a brief digression into Fourier analysis notation. In many 
problems, explicit Fourier series and integrals are used to describe image 
formation in the frequency domain. Unfortunately, these equations are often 
difficult to visualize and to manipulate in complicated problems, as for 
example when many functions are convolved or multiplied together. 
Fourier analysis of such cases is considerably simplified by using the shorthand 
notation introduced by Bracewell? and Goodman!. This notation has two 
elements: mathematical operators and basic function transform pairs. 
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From the previous sections we have that the two-dimensional* Fourier 
transform operator is defined by: 


co 


Е {(х,у)} = ff g(x,y) exp [-27i(xf, + yfy)] dx dy 
n (3.26) 
= g (f, f). 


Convolution of two functions f(x,y) and g(x,y) is denoted by f«g where 


со 


f(x,y) * g(x,y) = | || f(x-€, y-n) (Ёп) d£ dn. (3.27) 


— со 


Convolution is also defined for frequency domain functions, and is associative, 
commutative, and distributive. Scalar multiplication is denoted by the 
symbol *, as in f*g. The operations of convolution and multiplication are 
transform pairs, thus 


F{f-g} = F[t] + Fl]. (3.28) 


The basic functions which are useful in thermal imaging are given in 
Figure 3.12 as mnemonic codes with their definitions, graphs, and transform 
pairs. To keep the notation simple, the sifting property of the Dirac delta 
function which is defined by 


f(a) = || f(x) 6 (x-a) dx (3.29) 


is here represented by 


f(a) = 6(a) f(x). (3.30) 


*Whenever an imaging process occurs in a temporal filter, we will consider stationary scenes only, and 
implicitly perform a coordinate transformation to the spatial domain so that only a two-dimensional 
transform is needed. 


Chapter 3 


86 


(€ 30 | 12945) 'иопејои јепоцзипј ривциоц6 2176 anbi 


7 А^ллу ¥O4S 


с е 4 _ 
т< 0- 


орым 


e 


(^4) 9 (бе) ours Esg i= 


Оз- Ax ‘Q= 
О = Ax ‘|= 


A 


х 11V НОУ 
0 + ^'0- 


(4) 9 0-А1- 


А 11v НОЗ 
0 5х '0= 


(Ay) $ А | 0-х1- 


МУОЗ5МУНЈ H3lunod NOIL V.LN3S3Hd3H WOlHdV¥9 NOI.LINI33Q 


(^)9(x)9 


(А) $ 


(Хх) 


NOILONAS 


87 


Linear Filter Theory 


(£ о 2 39999) “UOHeJOU jeuonoun; риеуја0ц5 21'5 anba 


с. d 'Q- 
(Чу 0) ouissag X | H m + X= Н 
С 0, (=) 211 
—- = = — 9 
| iu! " 
А 


(EEN (EEN А 11у НОУ 


Ww _ (5) ous 


(^) 9 (хе) 128H 


q e, 
L < таму Le 5 '0- 


A x In 4 ше е ‘y= q E 
(а) 2:15 (е) 3915 | = va аму l = ји | = (3) 1089 (5 198g 
INHOJSNVH.I H3lH 04d NOILVIN3S3Hd3H IVOlHdaVH9 NOI.LINIJ3G NOILON 


Chapter 3 


88 


( 10 £ 19995) "uonejou јеџоцзипј риецзлоцсѕ 21/6 anbi4 


^ ллу ноя 


(^) 9 (е) dug 


(Аја) ашод (Xje) ашод 


(Ay) 9 (е) ашод 


(9) мне 


(но) 2415 


NOI.LINI33Q NOILON(3 


NOILVIN3S3Hd3H 1VOIHaVHuo 


INHOJSNVH1 H3lunOd 


Linear Filter Theory 89 


As a simple example of how the notation is used, consider the 
stylized detector array shown in Figure 3.13. The individual detectors are 
described by Rect (х/о) • Rect (y/B). An infinitely long detector array is 
described by [Rect (x/a) • Rect (y/8)] + [Comb (y/y) * 6(x)]. A finite detec- 
tor array is limited by Rect (y/B). Combining these, Figure 3.13 is com- 
pletely described by 


| [Rect (x/o) * Rect (y/B)] * [Comb (y/y) * 6001) * Rect (y/B). 


We transform this expression by taking transforms of the outermost 
bracketed terms in succession, yielding: 


F | [Rect (x/o) * Rect (y/B)] + [Comb (y/y) * 6(x)] | * Rect (y/B)| 

= F{ [Rect (x/o) * Rect(y/B)] ж [Comb (y/y) * 600] | * F {Rect (y/B)} 

= [F {Rect (x/a) * Rect (y/6)} * F {Comb (y/y) 569] ] + F{Rect (y/B)}. 
Using the transform pairs given earlier, this reduces to: 


(Sinc (of,) * Sinc (Bf) * Comb (Yfy)] ж Sinc (Bfy). 


This demonstrates some commonly performed manipulations. 


3.7 Typical Component OTF's 


There are three spread functions which are useful approximations to 
the spread functions commonly found in imaging system components. The 
first is the two-dimensional rectangular function [Rect (x/a) Rect (y/8)] 
shown in x-profile in Figure 3.14. As a first approximation, rectangularly- 
shaped detector elements are assumed to have such a spatial impulse 
response. In fact, the detector responsivity contours may be anything but 
rectangular, but manufacturers may not be able to control this precisely or 
the precise contour may not be known, so one usually makes the rectangular 
assumptions. 

If the scan is in the x-direction only, the detector OTF is the transform 
of Rect (x/o), is real, and is given by the function 


Ty = sin (maf, )/(maf, ) = sinc (of, ). (3.31) 


Values of this function and its square are tabulated in Table 3.1 and graphed 
in Figure 3.15. The rectangular function may also describe rectangular dis- 
play elements such as electroluminescent diodes or glow modulator 
cavities. 
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Figure 3.13 А detector array іп angular coordinates. 
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Figure 3.14 A rectangular detector profile. 
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Table 3.1 


2 
sin (raf sin (naf 
Values of um can and 
of sin (raf) E e| of sin (та?) sin ux 
naf (naf) naf (rof) 
0 1 1 1 0 0 
.05 .996 .992 1.05 -.047 .002 
1 .984 .968 1.1 -.089 .008 
.15 .963 .928 1.15 -.126 .016 
2 .935 .875 1.2 -.156 024 
‚25 .900 .810 1.25 -.180 .032 
3 .858 137 1.3 -.198 .039 
.35 .810 .657 1.35 -.210 .044 
E .757 .573 1.4 -.216 .047 
45 „699 488 1.45 -.217 ‚047 
5 .637 .405 1.5 -.212 .045 
.55 .672 .327 1.55 -.202 .041 
‚6 505 .255 1.6 -.189 .036 
.65 .436 .190 1.65 - 172 .030 
y .368 .135 1.7 -.151 .023 
15 .300 .090 1.75 -.129 .017 
8 .234 .055 1.8 -.078 .006 
.85 .170 .029 1.9 -.052 .003 
.9 .109 .012 1.95 -.025 .001 


.95 .052 .003 2.0 0 0 
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Figure 3.15 Sinc function MTF. 
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The second useful spread function is the two-dimensional gaussian 
distribution, the simplest of which is the circularly-symmetric gaussian shown 
in x-profile in Figure 3.16. The spread function is 


2 2 
exp (CE, (3.32) 


1 
(х,у) = 
(њу) > 222 


and the associated OTF’s аге 


~ _ -200 fy? 


-2т° 02 6,2 
ту =e е y . 


and гу = 


The elliptically symmetrical gaussian is 


(у) = =—— exp (- X^ Y exp (- - (3.33) 
? 270 Oy =) 29.2] | 
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Figure 3.16 The generalized gaussian LSF. 


The x and y dependences are separable and the associated OTF's are 


2 2r 2 2 2r 2 
Th) -e^7 0х (х anaYt)- e?" бу ју. (3.34) 


A normalized gaussian MTF is shown in Figure 3-17. Three gaussians are 
tabulated in Table 3.2 where the sigmas are normalized to a detector subtense 
o and the frequencies are normalized to f, = 1/o. These are useful for quickly 
estimating the effects, relative to the detector MTF, of video monitors or 
other components having gaussian MTF's. 

Although impulse responses are seldom precisely gaussian in shape, it is 
often helpful to approximate them by gaussians. For example, an optical spot 
size may be given as the diameter 2p of a circle in which a certain percentage 
of the total spot energy is found. If the spot is suspected to be gaussian, we 
may approximate it by finding the gaussian which has the same percentage of 
energy P in a circle of radius p. 
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Figure 3.17 The generalized gaussian MTF. 
Thus for the gaussian form 


(у) = exp [-(х+у)? /20? ] = exp CR? /207) 


shown in Figure 3.18 we must find the o for which the following condition 
is satisfied: 


f^ f КЕ) R dR dọ 
_O Oo . 


=- (3.35) 
J J (R) R dR аф 
O О 


Integration over ф and then over R yields: 
P= 1 — exp Co? /[2R?), 
and solving for o yields 


p 


0 Ta . (3.36) 
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Thus one can easily translate a percentage of energy P within a given diameter 
for a gaussian spot into a standard deviation. For example, if 85 percent 
falls within a circle of radius p, o = 0.51p. 


Table 3.2 
MTF's of Components Having Gaussian Spread Functions. Sigma's are Normalized to 
Detector Angular Subtense a. 


MTF - exp E (Cm ) БІ 


МТЕ МТҒ МТҒ 
ғ for for for 
fo (o/a) = .125 (с/о) = .25 (о/о) = .5 
0 1 1 1 
.05 .999 .997 .987 
1 997 987 952 
.15 993 973 .894 
.2 .987 .952 .821 
.25 981 925 735 
3 973 .894 .641 
.35 963 .859 546 
4 2952 821 454 
45 939 179 .368 
5 925 135 .291 
‚55 911 .689 .225 
.6 .894 .641 .169 
.65 .878 594 124 
7 859 546 .089 
75 841 500 2062 
‚8 821 454 ‚042 
‚85 800 410 2028 
.9 179 .368 018 
.95 .757 .328 .012 


1.0 135 .291 .007 
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r(R) = exp (-В2/202) ,у 


Figure 3.18 A circular gaussian distribution. 


Some impulse responses which are often appropriately represented as 
gaussian are symmetrically aberrated optical impulse responses, random scan 
encoder position errors, sensor vibrations, and cathode ray tube spots. 
Schade?^ presents numerous practical gaussian-type cathode ray tube spread 
functions and their transforms. Detailed knowledge of this sort usually is not 
available from the CRT manufacturer, who instead may specify spot size in 
terms of the semi-subjective shrinking raster test in which the raster is con- 
tracted electronically until modulation reaches the threshold of perceptibility. 
If the spot isa symmetrical gaussian, the sigma is related?°»°° to the shrinking 
raster center line to center line separation s by 


о =0.54s. (3.37) 


If the manufacturer specifies the resolution in terms of the highest spatial 
frequency square wave input which is resolvable on the monitor the sigma is 
related to the bar pattern period p by 


о = 0.42 p. (3.38) 
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Jenness, etal.?? calculated the ripple which results from television 
raster by gaussian shaped scan lines. For line spacing h and spot standard 
deviation о, they found for h/o ratios of 0 to 4 that the ripple is sinusoidal. 
The percentage ripple versus h/o is shown in Figure 3.19. 
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Figure 3.19 Ripple for a raster constructed of gaussian spots versus the 
ratio of line spacing to spot sigma, adapted from reference 35. 


The third useful spread function is the circularly-symmetric pillbox of 
Figure 3.20 given by 


КЕ) = Circ (S. (3.39) 


The OTF is real and is given by 
т = 2 Bessinc (p fg) = 21, (прѓр)/(трѓр), (3.40) 


where fp is spatial frequency in any radial direction. This is shown in 
x-profile in Figure 3.21. The circular pillbox may describe a severely 
defocussed optic, a circular detector, or a circular display element. The 
Bessinc function is accurately approximated?? by 


2 Bessinc (p fg) = Sinc (0.867 p fp). (3.41) 


98 Chapter 3 


Figure 3.20 A circular function. 
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Figure 3.21 Truncated Bessinc function. 
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3.8 Optical Component OTF’s 


The majority of thermal imaging systems operate with broad spectral 
band-passes and detect radiation noncoherently, so partial coherence effects? 7 
are not observed. Therefore classical diffraction theory is adequate for 
analyzing the optics of noncoherent thermal images. The OTF of a 
diffraction-limited optic depends on the wavelength of the radiation and on 
the shape of the aperture. Specifically, the OTF is the autocorrelation! of 
the entrance pupil* function p(x,y) with entrance pupil coordinates x and y 
replaced by spatial frequency coordinates х/Х and y/X. 


OTF (f, = fy =3) = 


Шаа 


| | p (Еп) d£dn 


(3.42) 


For the simple case of plane wave monochromatic illumination on a 
clear rectangular aperture, the OTF is real and the autocorrelation simplifies! 
to an MTF, 


T=1- for <I, (3.43) 
C C 
where 
D 
Ё, --- ; (3.44) 


D, is the aperture width, and А is the wavelength. 


*If spatial frequencies are expressed in object plane coordinates. 
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For a clear circular aperture, 


т-2 f f Ба Бы i 
г= = јаз E-E | 1- (£) | | , for г < 1, (3.45) 


where f. = D,/A and D, is the entrance pupil diameter. The PTF for a diffrac- 
tion limited optical system is always zero. This MTF is tabulated in Table 3.3 
and shown in Figure 3.22. An approximate expression accurate to £0.01 when 
both the exact and the approximate expressions are rounded to two decimal 
places is 


T~ 1— 1218 (ғ) over 0 < 7 « 0.6. (3.46) 
MTF 
1.0 
8 
6 
4 
‚2 
92 2 4 6 8 1.0 


со 


Figure 3.22 MTF's of diffraction-limited circular and rectangular 
lens apertures with f = D/A. 


O’Neill! derived the MTF for the case of a circular diffraction-limited 
lens with a circular central obscuration. The resulting formulas are too com- 
plex to reproduce here, but several cases calculated by Viswanathan?* are 
shown in Figure 3.23. The lens diameter is По, the obscuration diameter is 
nD,, and the fractional obscuration is 1-n’. 
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Table 3.3 
MTE of an Unobscured Circular Aperture 


ДА МТЕ f/f, MTF 
0.00 1.0000 5 0.3910 
0.025 0.9682 525 3638 
05 9364 55 3368 
075 9046 575 3105 
10 8729 6 2848 
125 8413 625 2696 
15 8097 65 2351 
175 7783 675 2112 
2 7471 7 1881 
225 7160 725 1658 
25 6850 75 1443 
275 6543 775 1237 
3 6238 8 1041 
325 5936 825 0855 
35 5636 85 0681 
375 5340 875 0520 
4 5046 9 0374 
425 4756 925 0244 
45 4470 95 0133 
475 4188 975 0047 
1.00 0000 


Equation 3.45 is the diffraction-limited MTF for plane wave monochro- 
matic illumination, whereas the cases of real interest involve broadband 
illumination. If incoherence may be assumed, the polychromatic impulse 
response is the weighted sum of the component monochromatic impulse 
responses. For the special case of a uniformly weighted spectral distribution 
of plane wave illumination incident on a rectangular or circular aperture, the 
impulse response can be obtained analytically. The broadband incoherent 
impulse response is 
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Figure 3.23 MTF’s of centrally-obscured circular apertures versus obscuration 
diameter ratio, adapted from reference 38. 


Хо 
r(x) = || “ху ах, (3.47) 
Ay 


and the transform is 


Ay A2 
7(f,) =F Í (ху dab = || T(f,,d) dX. (3.48) 
Ài № 


For the 8 to 14 um and the 3 to 5 um bands the difference between the exact 
expressions resulting from the integration and the monochromatic expressions 
using the mean wavelength (A, + A, )/2 to replace А are insignificant. 

The impulse response of a non-diffraction-limited optical system is 
determined by wavefront errors such as the primary aberrations: defocussing, 
spherical aberration, chromatic aberration, coma, distortion, astigmatism, 
and field curvature; and by higher order aberrations. The OTF of an aberrated 
optical system is calculated by replacing! the real pupil function p(x,y) for 
the diffraction-limited case with a generalized complex pupil function 
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_ 2ті 
PG) = реку) exp |- =F wey) (3.49) 


where W(x,y) is called the aberration function. W(x,y) is the wavefront 
deviation in the exit pupil at coordinates (x,y) from an ideal unaberrated 
spherical converging wavefront. The OTF is found by substituting the com- 
plex pupil function for the unaberrated pupil function in the autocorrelation 
equation. 

Aberration functions W(x,y) for the various aberrations are derived as 
demonstrated by Klein??. Discussion of these wavefront aberrations and the 
complicated OTF formulas which result are beyond the scope of this book. 
The reader may consult a text such as Smith*®, Klein??, Barakat*!, ог 
O'Neill? for summaries of the aberrations, and references 1, 9, 10, 11, 13, 
14, and 15 for details. Slight aberrations can produce serious degradations 
in MTF accompanied by severe phase shifts. 

The aperture size іп ап 8-14 um system is usually determined jointly by 
resolution and thermal sensitivity requirements, and such a system typically 
is designed to be diffraction-limited. Systems in the 3 to 5 band, however, 
tend to have apertures sized for only thermal sensitivity, because the diffrac- 
tion limit is higher at shorter wavelengths. These systems tend to be simpler 
but non-diffraction limited. 


3.9 Electrical Processing OTF's 


The electrical filters used in detector signal processing differ from other 
thermal imaging system components in that their spread functions are single- 
sided due to the time causality condition in stable electrical circuits. 
Consequently, their time domain convolution integrals are single-sided and 
their impulse responses are Laplace transformed rather than Fourier trans- 
formed to yield the frequency response. Electrical filtering occurs in detector 
signal preamplifiers, in video processing amplifiers, and in MTF-boosting or 
aperture correction circuits. Also, detectors are in effect electrical filters 
because their response times are limited by such mechanisms as charge 
diffusion lengths, recombination rates, and thermal time constants. 

Electrical or temporal frequencies in Hertz are conveniently related to 
scan direction spatial frequencies in cycles/milliradian by multiplying by the 
ratio of the detector subtense о to the detector dwelltime та, 


f[Hertz] = 2 f [cy/mrad] . (3.50) 
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Subsequently, we will let f refer either to electrical frequency or to the 
equivalent spatial frequency, and let the appropriate units be implied by the 
context. 

Any single electrical filter or ensemble of cascaded filters may usually be 
approximated by cascaded combinations of three types of filters. These are 
the RC filters, so named because they are represented by circuits using ideal 
resistors and capacitors. These three types of RC filters are the high-pass, 
the all-pass lead, and the low-pass filter, and are shown in Figure 3.24 together 
with their characteristic frequencies fẹ and their transfer functions. Note 
that these OTF's are complex, indicating nonzero phase transfer functions 
and the presence of phase shift. The MTF of the single RC low-pass filter is 
tabulated in Table 3.4 and graphed in Figure 3.25. 

Electrical phase shifts from these components are linearly converted 
into displayed spatial phase shifts, and the spatial direction of the phase 
shift is determined by the scan direction. Thus if there is an electrical phase 
lag and the scan is from left to right, the shift on the display will be to 
the right. With unidirectional scan, moderate phase shift does not seriously 
degrade image quality. With bi-directional scan, however, phase shifts on 
successive frames are in opposite directions, and the full frame-to-frame 
phase shift in a stationary target is doubled when seen by a sensor (such as 
the eye) whose integration time is greater than the frame time. 

The consequence of this is a serious image blurring which we will now 
describe mathematically. Let OTF(f) be the complex transfer function of the 
electrical filter 


OTF = Re(OTF) + j (ОТЕ), (3.51) 


where Re and Im denote the real and the imaginary parts of the OTF. The 
bi-directional scan produces a multiframe-averaged real unnormalized trans- 
fer function 


T (f) = OTF (f) + OTF* (f) = 2 Re (ОТЕ), (3.52) 


where the asterisk denotes the complex conjugate. 
Expressed in polar form this is 


Т = ОТЕ exp (j arg OTF) + OTF exp Cj arg OTF) 
(3.53) 
= 2 OTF cos [arg (OTF)]. 


Normalized to a value of one, this is: 


1 = ОТЕ cos [arg (OTF)]. (3.54) 
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Table 3.4 
MTF of a Single-RC Low-pass Filter. 


f/f, [1H (f/f)? ]1 2 f/f, [1--(f/£,)? ] 1? 
0 1 2.2 .414 
.1 .995 2.3 .399 
.2 .981 2.4 .385 
3 .958 2.5 .371 
4 .928 2.6 .359 
5 .894 2.7 .347 
.6 .857 2.8 .336 
y .819 2.9 .326 
.8 .781 3.0 .316 
9 743 3.2 .298 
1.0 107 3.4 .282 
1.1 .673 3.6 .268 
1.2 · .640 3.8 .254 
1.3 610 4.0 243 
1.4 ‚581 4.5 .216 
1.5 .555 5.0 .196 
1.6 .530 5.5 .178 
1.7 .507 6.0 .164 
1.8 .486 7.0 .141 
1.9 .466 8.0 .124 
2.0 .447 9.0 .110 
2.1 .430 10.0 .100 
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f/f, 


Figure 3.25 Single RC low-pass filter MTF. 
Thus the time-averaged bi-directional phase shift results in a real OTF. 
Since the argument of a complicated filter OTF is the sum of the component 


OTF arguments, the MTF loss due to phase shift alone in bi-directional 
scanning with phase shift is 


T = cos E arg (оте) | . (3.55) 


1 


For the single-RC low pass filter, the complex OTF is 

ОТЕ = (I-Hjf2rRC)" (3.56) 
and the normalized unidirectional MTF Tis 

T= [1 + (2rf RC? |]! 7. (3.57) 


With bi-directional scanning, the MTF is degraded to 


T=[1+ (Сл fRC) ]"!. (3.58) 
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This bi-directional scan-induced MTF loss is more serious than it may 
appear at first glance. Consider again the single-RC low-pass filter. This 
filter has a PTF of arctan (-f/f.), which at low frequencies is approximately 
-f/f., that is, which is linear with frequency. Now this linear frequency- 
dependent phase shift is simply a time lag and is displayed as a translation 
in the scan direction, so combined with a bi-directional scan the imagery 
has a slightly “doubled” appearance. For example, a slit input produces 
the image shown in Figure 3.26. Thus, even a slight phase shift may cause 
objectionable image degradation. This effect, and the edge flicker effect 
discussed in Section 4.6.3, strongly militate against bi-directional scanning. 


Figure 3.26 Image doubling due to bi-directional scan. 


The three filters of Figure 3.24 can be used individually or in combina- 
tion to describe most realizable filters. Many detectors, for example, behave 
like single-RC low-pass filters with responsive time constants of 


тр = 1/(2mRC). (3.59) 


The simplest ac coupling device is the single-RC high pass filter, and many 
preamplifiers behave as single- or double-RC low pass filters. Many boosting 
circuits may be approximated by an all-pass lead filter followed by double-RC 
low-pass filter. If these simple circuits are inadequate, equivalent RLC 
networks can often be devised. 


3.10 Image Motion and Image Position Uncertainty MTF Losses 


In addition to the sources of resolution loss discussed in Section 3.5 
which are invariably present in thermal systems, there are four other 
mechanisms which must be dealt with. These are sensor vibration relative to 
the scene, display vibration relative to the observer, uncertainly in the scan 
(read in) position at a point in time, and uncertainly in the display (write out) 
position in time. Motion of the sensor relative to the scene occurs when the 
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sensor is inadequately stabilized to compensate for platform vibrations and 
wind buffeting. Display vibration relative to the observer occurs when the 
platform shakes at rates and amplitudes such that the observer’s body and 
head move more than his visual system can compensate for. Uncertainty 
in the read-in position occurs when there is uncertainty in the scan mirror 
angle from frame to frame at a given nominal angular position. Write-out 
position errors occur when the scan position encoder introduces uncertainty 
in the scan position signal. 

Random motions which are periodic with frequencies which do not 
exceed about 2 cycles per second do not cause serious image blurring because 
the eye’s integration time is too short to fuse such motions. Since the eye’s 
ability to extract information in the presence of image motion is not well 
understood, it is not possible to associate a given motion with an effective 
OTF loss unequivocally. However, it seems likely that purely random motions 
or position errors may be described by the Fourier transform of the probability 
density function of the image position when the density function is con- 
tinuous. For example, a scan position pickoff or synchronization error which 
has a gaussian probability density centered on the correct value of scan 
position should have an effective an MTF which is gaussian with a sigma 
the same as that of the density function. 


3.11 Equivalent Bandwidth, Equivalent Resolution, and the Central 
Limit Theorem 


Schade? discovered that the apparent image sharpness of a television 
picture tends to be describable by the integral over frequency of the square 
of the television system's MTF. Schade called this integral the equivalent 
line number N, [cycles/picture width], 


оо 


Ne Д | [7(D]? df. (3.60) 
0 


This quantity is in effect an equivalent square bandwidth, as shown in 
Figure 3.27. N, is one of the best measures of image sharpness and resolution 
in non-noisy imagery, and its significance is discussed in detail in Section 4.10. 

If М. is a good measure of resolution in the frequency domain, then 
there should exist a related parameter in the spatial domain which we could 
call equivalent resolution. Sendall^? examined all of the reasonable potential 
definitions of an equivalent resolution r, and concluded that the best one is 
T= 1/2Ng, since this gives т = а for a rectangular spread function. Figure 3.28 
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Figure 3.27 An example of equivalent bandwidth 
for a Gaussian MTF. 


LSF MTF Ne T 
Rect (x/a) Sinc (af) 1/20 а 
eX’ /20* е-2т° of? 1/40 Мт = .142/0 0/.284 
a?/(x? + a?) е'2а | 1/2a a 


Figure 3.28 


Examples of equivalent bandwidths and resolutions. 
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gives № and т for some common OTF’s. This definition for t has the dis- 
advantage that ris not necessarily the angular dimension of the smallest detail 
resolvable by the system. 

The well known central limit theorem of probability and statistics has 
an analog? in linear filter theory, which is that the product of N bandlimited 
continuous MTF's tends to a gaussian form as N becomes large*. Many 
thermal imaging systems have at least four component MTF's, and often as 
many as seven, so the system line spread function can often be adequately 
approximated by r(x) = exp (-x?/20?), where o is the standard deviation. 
The corresponding MTF is: 


КЕ) = exp C21? 0? f,?). (3.61) 


Asan example, Table 3.5 and Figure 3.29 show component MTF's which 
might be used with the one milliradian example system and whose product 
is a system MTF which is approximately gaussian. Thus it is worthwhile for 
analytical and specification purposes to note some properties of this function. 
The standard deviation of a spread function may be estimated by finding the 
points r(o) = .61 or r(1.175 о) = .5. The scale of an MTF may be estimated 
by finding the frequency f= .159/o at which T= .61. Various estimates of 
spread function size in terms of the percentage of total spread function 
energy P between x = -х and x = x, may be converted to a gaussian spread 
function from the following values: 


P Xo 
383 0/2 
.683 o 
842 20 
955 20 
997 30. 


Another way to approximate an MIF by a gaussian is to find the o 
which makes the gaussian’s N, of 0.141/o equal to the N, of the functionally 
unknown MTF. Three universal gaussian curves are shown in Figure 3.30 as 
functions of the dimensionless parameter Kf for o's of 0.4 K, 0.5 K, and 
0.6 K. These curves may be used to see the effect of o on the MTF of a 


*A rigorous statement of the central limit theorem is given in reference 43. 
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system by choosing a value for K. AK of 0.25 mrad gives a nominal 0.25 mrad 
system with a cutoff near 4 cy/mrad, and the figure then shows MTF's for 
o’s of 0.1, 0.125, and 0.15 mrad. 


Table 3.5 
MTF's for the Example System. 


a = 1.0 mrad, f, = 1 cy/mrad 
Dg = 2 cm, f, = 2 cy/mrad at л = 10 um 


electronics characteristic frequency = 0.5 cy/mrad 


бт. as 
optics detector electronics monitor system gaussian MTF 
f cy/mrad To Td Telec tm Ts with о = .55a 
0 1 1 1 1 1 1 
Л 9634 984 981 987 918 942 
2 .8729 .935 .928 .952 ‚721 .788 
3 8097 858 „857 894 „532 584 
4 7471 157 ‚781 ‚821 .363 .385 
5 .6850 .637 .707 .735 227 .224 
.6 6238 505 .640 .641 .129 117 
7 5636 .368 583 546 .066 .054 
8 .5046 234 530 „454 .028 .022 
9 .4470 .109 486 .368 .009 .008 
1.0 3910 0 447 ‚291 0 .003 
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Figure 3-30 Example gaussian MTF's. 
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CHAPTER FOUR — VISUAL PSYCHOPHYSICS 


4.1 Introduction 


The problem to be addressed in this chapter is the following: how much 
can the electro-optical image deviate from "reality" before the human visual 
system becomes disoriented and cannot efficiently extract information from 
it? This question cannot be answered unambiguously today, but to approach 
the answer we will consider the following visual processes and effects: 


€ resolving power and spatial frequency response; 

perception of noise; 

image magnification; 

discrimination of signals from noise and/or background; 
disagreeability of effects such as too large or small field of view, 
too fast scene motion, raster, and flicker. 


Some of these effects are well understood, but a frequent problem is 
that the understanding is usually in the terminology of the field of visual 
psychophysics, so that it must be translated into the imaging systems view- 
point. On the other hand, some of the significant and original research on 
vision has been and continues to be conducted by engineers seeking solu- 
tions to practical imaging problems. Before considering the specifics of 
visual psychophysics, a brief summary of general visual system features is 
in order. 


4.2 General Characteristics of the Human Visual System 


We will consider here only those characteristics of the eye which are 
important for thermal imaging system design. The reader interested in a 
broader treatment should consult Gregory! 7, Cornsweet?, Davson^, Fry?, 
or Schade?. The eye has a moderate quality field of view of approximately 
30 degrees in elevation and 40 degrees in azimuth, a high quality circular 
field called the foveal field of about 9 degrees, and a best vision field of about 
1 to 2 degrees in diameter. The 30 by 40 degree field is used in most visual 
tasks, and it is for this reason that sensors with 4:3 aspect ratio fields (such 
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as commercial television) are adjudged more aesthetically pleasing than simi- 
lar sensors with other aspect ratios’. Color vision extends over about 
90 degrees of the central field, with color sensitivity decreasing as the retinal 
edge is approached. The very rim of the retina does not produce a conscious 
sensation of vision, and is used only for motion detection. A normal youthful 
eye can focus on objects as close as about 25 centimeters, but this minimum 
adaptation distance increases with age. 

The luminance range within which the eye can usefully operate extends 
over about nine decades, and it is sensitive over about ten decades. The eye’s 
sensitive range is divided into three somewhat indistinct luminance regions 
called the scotopic, the mesopic, and the photopic ranges. The scotopic 
range extends from the absolute visual luminance threshold of about 
10°° foot-Lambert (fL) up to about 1074 fL; the mesopic range from around 
10^ fL to 1 fL; and the photopic from about 1 fL up to 104 fL. These 
three regions correspond roughly to the shift from rod or extrafoveal vision 
at extremely low levels, to a combination of rod and cone vision at twilight 
levels, to cone or foveal vision at the highest levels. 

The resolution capabilities of the eye are describable several different 
ways. Perhaps the simplest measure of visual resolution is the visual acuity, 
the reciprocal of the smallest angular detail in arcminutes resolvable by the 
eye. The maximum value of visual acuity obtained depends on the definition 
of what constitutes the smallest resolvable angular detail, but nominally it 
has.a value of one reciprocal arcminute. Visual acuity decreases as the posi- 
tion of the target moves away from the line of sight. 

The resolution capabilities of the eye are controlled to some extent by 
the pupil diameter. When light levels are high and the eye does not need all 
of the available energy to function properly, the pupil is constricted. This 
blocks off-axis rays which produce aberrations, thereby improving resolution 
in bright light. At normal levels the pupil diameter is about 3 mm, and the 
range of variation with light level is about 2 mm to 8 mm. 

It is important to understand how the visual system works so that sub- 
jective impressions of image quality may be translated into measurable 
objective correlates. There are four distinct subjective impressions of quality. 
These are the perceptions of sharpness, noisiness, contrast, and spurious 
image structure. Sharpness is the term used to describe the overall detail 
rendition of a system and generally describes the ability of an observer to 
extract information from quiet, high contrast, interference-free imagery. The 
terms resolution and resolving power are often used synonymously with 
sharpness, although they denote slightly different qualities. Noisiness refers 
to the degree of masking of signals by fixed-pattern or time-varying noise. 
Other common terms for image noise are granularity and ''snow". Contrast 
describes the brightness of interesting details relative to the brightness of 
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background details. Spurious image structure includes such artifacts as flicker 
due to perception of framing action, aliasing due to too low a spatial or 
temporal sampling rate, inter-line flicker in interlaced systems, line crawl, 
and optical distortion. 


4.3 Spatial Frequency Response 


One of the best understood and most investigated aspects of visual 
psychophysics is the sine wave response of the eye. For analytical conve- 
nience, it is desirable to treat vision as a linear process, thereby utilizing the 
power of linear filter theory and the concept of the optical transfer function. 
However, visual processes are not always linear, as in the case of the Mach 
phenomenon.* It is nevertheless useful to approximate the behavior as linear 
for low image contrasts, but to emphasize this approximation we will refer to 
the eye’s sine wave response (SWR) rather than to an eye OTF. Sine wave 
response is defined whether an operation is linear or not, and in the non- 
linear case it is multi-valued depending on light level and other parameters. 

The sine wave response is determined by at least eight processes: 


Diffraction at the pupil; 
Aberrations in the lens; 

Finite retinal receptor size; 
Defocussing; 

Ocular tremor; 

Retinal receptor interconnections; 
Optic nerve transmission; 
Interpretation in the brain. 


The overall SWR of the eye may be measured by the technique of 
liminal (threshold) recognition of sine wave patterns presented to the eye, or 
by a suprathreshold technique of comparing the peaks and troughs of the 
perceived sine wave to adjustable calibrated sources. With the liminal 
method, the contrast at which a sine wave is resolvable with a specified level 
of confidence is plotted versus spatial frequency, as shown in Figure 4.1. The 
assumption is made that the variation with spatial frequency is due to spatial 
filtering and that the threshold contrast equals a minimum value CMIN 
divided by an eye MTF. This MTF is obtained by dividing the curve by Cyn 
and inverting the values, as shown in Figure 4.2 


*The overshoot and undershoot perceived at the transition between light and dark at a sharp high 
contrast edge target. 
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VIEWING DISTANCE = 10 FT. 
BACKGROUND LUMINANCE = 20 fL. 


0 1 2 3 
SPATIAL FREQUENCY [cy/mrad] 


Figure 4.1 A measured sine wave threshold contrast curve, 
adapted from reference 19. 


SWR 


0 1 2 3 
SPATIAL FREQUENCY [cy/mrad] 


Figure 4.2 The calculated eye SWR for the contrast thresholds of Figure 4.1. 
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The problems with these kinds of measurements are fairly obvious. A 
threshold measurement is necessarily made at a low signal-to-noise ratio, so 
the visibility of sine waves is related not only to the signals in the eye-brain 
system, but also to the sources of noise in the imaging system and in the 
brain’s decision-making processes. Also, the threshold results are not neces- 
sarily valid for high contrast imagery, whereas the suprathreshold measure- 
ments necessarily reflect the influence of the nonlinear Mach phenomenon. 
Even though the visual system is demonstrably nonlinear, nonstationary, and 
nondeterministic, the concept of an eye SWR is still very useful whenever we 
are able to use a small signal analysis. 

The eye’s sine wave response has been measured under a variety of con- 
ditions, and the following parameters have been found to have a significant 
effect on the SWR: 


€ Adaptation (mean background) luminance; 
Viewing distance at constant magnification; 
Target angular orientation; 

Target color; 

Exposure time; 

Temporal frequency of presentation. 


The following six basic conclusions may be drawn about the eye SWR: 


l. The observed spatial frequency response of the eye is not the re- 
sult of a single spatial filter, but is rather the envelope of multiple 
narrowband tuned filters. Each of these filters is tuned to a differ- 
ent center frequency and is stochastically independent of the 
others. Each is independently selectable and the brain constructs 
an approximate optimum filter for a given detection task by appro- 

. priate selection of filters from the available filter ensemble. 

2. This filter ensemble has an envelope of the general shape shown in 
Figure 4.3 for a single set of conditions. In general, the response 
is poor near dc, rises sharply to a maximum at low frequencies, 
and falls off in approximately gaussian fashion at high frequencies. 
Depending on the individual, the double peak shown in Figure 4.3 
may be pronounced, barely noticeable, or absent altogether. 

3. The spatial frequency at which the peak occurs shifts higher as the 
average adaptation luminance increases, and the overall SWR covers 
a broader frequency range, as shown in Figure 4.4. For the lumi- 
nance range ordinarily encountered with an electro-optical display, 
the peak occurs in the neighborhood of 0.1 to 0.4 cy/mrad. 
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Figure 4.3 An eye sine wave response function showing double peaking behavior, 
measured using a CRT viewed at 39 inches, adapted from reference 22. 
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Figure 4.4 An eye sine wave response demonstrating the effect of average 
target luminance, measured using a CRT, adapted from reference 6. 
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4. At constant magnification, the peak frequency shifts higher in 
frequency as the accommodation distance increases. Also, the SWR 
broadens with increasing distance due to the lesser eye lens curva- 
ture and the rejection of off-axis rays. 

5. The threshold contrast at which sine waves are detectable de- 
creases as adaptation luminance and adaptation distance increase. 

6. The contrast threshold increases as the angle that the sine wave 
chart makes with either the horizontal or the vertical approaches 
45 degrees. 


The first direct measurements of eye SWR were made by Schade? in 
1956 using the threshold technique. Many other investigators have varied 
Schade’s basic experiment and most of them verified Schade’s general 
findings. The most significant deviation occurred when Campbell and 
Robson® compared the contrast thresholds for sine wave, square wave, 
rectangular, and sawtooth spatial frequency patterns, and concluded from a 
harmonic analysis that multiple narrowband tuned filters are present. Then 
Sachs, et а1.7, demonstrated more directly that the spatial frequency response 
of the eye represents not a single filter, but is rather the envelope or sum of 
several parallel channels, each of which is a narrowband tuned filter. Further- 
more, they showed that the noise statistics of these channels are independent 
of each other. The measurements of Stromeyer and Julesz!? also support 
this theory. 

Most eye SWR experiments have been performed using the threshold 
method, but useful visual tasks are not always performed at threshold. Thus 
the research on suprathreshold SWR by Watanabe, et al.!! , is particularly 
interesting. They made suprathreshold (as well as threshold) measurements 
by the comparison method and observed that the higher above threshold the 
pattern becomes, the wider the frequency response is. They suggest that their 
SWR data are well fitted by the following formulas: 


low frequency liminal region: 


between SWR = 1-[1 + (f/£f,) ] ! ? (4.1) 

and SWR = 1-[1 (УІІ; (4.2) 
high frequency liminal region: 

SWR = [1 + (И) ) ]? : (4.3) 
high frequency suprathreshold region: 

SWR = e(f/f2)’ - (4.4) 


where f,, f,, and f, are characteristic frequencies. 
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Some good summaries of the eye’s sine wave response characteristics 
are given by Ету! 2, Fiorentini! 3, Levi! ^, Westheimer! ? , and Ostrovskaya! 6. 
The latter reference includes results of some studies not available in English. 
Figure 4.5 shows the range of variation in the results of those experiments 
using natural pupils from the experiments reported in references 6 and 8 
through 28. 


SPATIAL FREQUENCY [cy/mrad] 


Figure 4.5 Range of variation of measured sine wave responses, 
adapted from references 6 and 8 through 28. 


In general, the SWR as a function of light level seems to be explainable 
as follows: at very low levels, the pupil is wide and aberrations dominate; at 
moderate levels, such as 20 fL, the retina and brain dominate; and at very 
high levels with narrow pupils, diffraction dominates. The results of the fol- 
lowing investigators are reasonably consistent with each other and are ap- 
plicable to electro-optical imaging problems: Schade$ , Lowry and DePalma’ ?, 
Campbell?5, Gilbert and Fender? $, Pollehn and Roehrig?", Kulikowski^*, 
Robson!?, and Watanabe, et al.!!. Peaking behavior is not observed below 
about 5 fL and the highest resolved frequency reported is 3.44 cy/mrad. 

Understanding the filtering properties of the visual system is important 
for two reasons. First, the eye itself approximates an optimum filter, so 
trying to improve the eye's performance by prefiltering the image spectrum 
is a questionable procedure. Second, the efficient coupling of the final image 
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to the eye depends in large measure on the overall system magnification. 
Obviously, one should not present information at 4 cy/mrad to an eye limited 
to 2 cy/mrad or less. This coupling problem and the important choice of an 
efficient magnification are the subjects of the following discussion. 

Schade® observed that the eye sine wave response is approximately 
gaussian above the peak frequency, so that 


SWR~ exp C27? оё f?) (4.5) 


where o, is the standard deviation of the equivalent line spread function as 
defined in Section 3.14. Consider the gaussian MTF's shown in Figure 4.6 
for sigmas of 0.15, 0.2 and 0.25 mrad. If we compare these gaussian MTF’s 
with the measured eye responses, we see that most of the results of interest 
are bounded by the о = 0.2 and the o = 0.3 mrad curves. Alternatively, one 
could use the form 


SWR ~ [1 + (f/f, )? ]? (4.6) 


suggested by Watanabe,et al.!! Some equations of this form are graphed in 
Figure 4.7. 


mrad 


= .15 
= .20 
= .25 
= .З 


SPATIAL FREQUENCY [cy/mrad] SPATIAL FREQUENCY [cy/mrad] 


Figure 4.6 Sine wave responses given by Figure 4.7 Sine wave responses given by 
exp І-2л? оь2 f?]. [1 + (f/f, )?] ? 
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For analytical purposes, we shall assume following Schade that: 
2 
SWR = e2"^ Ое f? tor f > 0.2 cy/mrad. (4.7) 


4.4 Magnification Effects 


Schade suggested that the angular magnification M of a device be chosen 
so the frequency of greatest interest coincides with the peak frequency of 
SWR. This is an approximate way to match the eye and the system MTF's 
together to achieve the best overall MIF. We may carry this one step further 
by asking what the value of M is which optimizes the equivalent bandwidth 
N'of the eye-system combination. A useful system MTF, and one particularly 
amenable to analysis, is the gaussian form. The effect of increasing magnifica- 
tion is to widen the eye’s effective SWR with respect to the system’s MTF. 
If we let the system sigma be o, we may write 


N'- [SWRI R? ( af 
2M 


- | exp [^r f? (55) + б Jat (4.8) 
.2M 
Here we are referencing all frequencies to object plane spatial fre- 
quencies, so that N’ may be compared with the original system bandwidth 
N, before the eye is included. By a variable of integration change, equation 
4.8 becomes 


со 


М 
N= 2 eM зү || e dx | (49) 
+ 
тое Os | 0.4M[o,” + M?o,?]!? 


Normalizing to N, = 


40, ут 


оо 


N. ___ Ms ___ eX dx | (4.10) 
М; /т [06 + M? 021. 0.4M [067 + M2 042112 


The parameter N/N; is plotted as a function of Mo, on Figure 4.8 for 
бе = 0.25 mrad. 
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OM 


Figure 4.8 Eye-system equivalent bandwidth normalized to the system bandwidth 
and plotted as a function of the product of magnification and system sigma. 


There are several conclusions which may be drawn from this function. 
First, the optimum magnification is 1/(40,), there is broad optimal range, 
and the 3 dB-down points are separated by a magnification change of ap- 
proximately 5X. Second, the N’ at 1/(40,) is slightly less than half of the 
N, of the system, 0.141/o,. Third, it is clear that over- and under-magnification 
can result in severe loss of overall resolving power. As an example, we may 
consider a system with o, = 0.125 mrad, nominally а “0.25 mrad” system. 
The optimum magnification in this case is 2.0, and a range of 0.8 to 4.0 
would probably give good results. 

Bennett, et al.?? , in the course of photointerpreter performance studies, 
found that magnifying one resolution element beyond 3 arcminutes 
(0.873 mrad) at the eye resulted in degraded performance. If we interpret 
this loosely, we may say that the system cutoff frequency divided by M 
should not be less than about 1.1 cy/mrad. Thus, a 4 cy/mrad system would 
need at most a magnification of 3.6, which is consistent with the previous 
discussion. 


4.5 Visual Sensitivity to the Spatial Frequency of Random Noise 


Random noise interferes with the proper functioning of a thermal 
imaging system in two ways. First, random noise interferes with an observer’s 
ability to detect, recognize, and identify targets on a display. Second, certain 
kinds of noise are so objectionable by themselves that an observer may not 
look at the display at all. Therefore it is important to know how noise is 
perceived. 
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The first investigators to report that the eye perceives different spatial 
frequency distributions of noise differently were Mertz?? and Baldwin?!. 
Baldwin subjectively measured the visibility of various spectral types of tele- 
vision noise, and Mertz deduced from these measurements the eye’s noise 
weighting function, or sensitivity to the spatial frequency content of random 
noise. Baldwin observed the peculiar effect that perception of white noise 
with a sharp cutoff frequency f, is independent of f,, apparently because the 
masking effect of higher frequency noise offsets the increased noise amplitude 
due to the widened bandwidth. Coltman and Anderson?” reverified this 
observation by measuring the detectability of noisy television patterns. They 
summarized their findings this way: “the masking effect of white noise depends 
only on the noise power per unit bandwidth, and is independent of the upper 
frequency limit of the noise spectrum, provided that this exceeds the frequency 
limit set by the eye," That frequency limit is approximately 0.3 cy/mrad. 

Barstow and Christopher?3>3* investigated noise visibility using a 
6 by 8 inch television display viewed at 24 inches against a 0.0025 fL back- 
ground with noise present in grey areas having luminances of from 0.3 to 
0.6 fL. They found that noise power within any narrow spectrum is more 
objectionable than the same amount of power in a broader spectral band 
with the same center frequency. Also, low frequency narrow band noise is 
more objectionable than the same noise power in a higher frequency band. 

Barstow and Christopher gave vivid descriptions of the appearances of 
various types of noise. They observed that very low-frequency noise appears 
streaky and annoying. When the frequency of the noise is increased the noise 
has the appearance of fine-grained photographic noise in random motion 
with the apparent grain size decreasing as the frequency increases. They also 
observed that narrow band noise resembles a herringbone pattern. By com- 
paring the interference of narrow bands of noise, they found the eye’s sen- 
sitivity to television noise for 1962 vintage television. These results are shown 
in Figure 4.9. Brainard, et al.?? conducted similar experiments with 225 line, 
30 fps imagery. 

Brainard? conducted a noise weighting function measurement using a 
60fps 160-line TV with a 4.5 inch square picture viewed at 32 inches. 
Brainard detected a fine structure superimposed on the behavior observed by 
Mertz, by Barstow and Christopher, and by Brainard, et al. He found that 
the eye is more sensitive to narrowband noise centered on line frequency 
multiples than to noises between line frequency multiples. This effect will be 
Observed in television-type scanning systems only. An interesting possible 
application of this to FLIR systems is the following. Noise at line-frequency 
multiples appears as a varying level from line to line which is approximately 
constant across one line. The same effect will result if the dc level or the ac 
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Figure 4.9 The noise perceptibility data of Barstow and Christopher, 
adapted from reference 34. 


gain from line to line varies, as it tends to do in systems where each line is 
created by a different detector. The objectionability of such line-to-line noise 
is evident from Brainard’s data showing increased eye sensitivity to these 
noises. 

One of the most interesting and comprehensive studies of the effects of 
still noise was performed by Huang?'. Huang generated three still mono- 
chrome pictures (a face, a cameraman, and a crowd), and added independent 
additive rectangular low-pass gaussian noise in both the horizontal and the 
vertical directions. By controlling the bandwidths in each direction inde- 
pendently, he was able to generate “‘isopreference curves" which indicate the 
various combinations of vertical and horizontal noise which give equivalent 
image degradation. Using 3.5 by 3.5 inch photographs with signal-to-noise 
ratios of 26, 22, and 18 dB (19.95, 12.59, 7.94 linear signal-to-noise ratios) 
and with horizontal and vertical noise bandwidths N, and N, ranging from 
0.076 to 1.908 cy/mrad, observers rank-ordered the imagery under 30 foot- 
candle illumination. Figures 4.10 and 4.11 show the differences between the 
isopreference curves for a moderately detailed picture (a cameraman) and a 
highly detailed picture (a crowd). Huang found from data reduction that the 
noise weighting function has the same type of behavior as the sine wave 
response. 
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Figure 4.10 Noise isopreference curves for a moderately detailed 
picture (a camerman), adapted from reference 37. 
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Figure 4.11 Noise isopreference curves for a highly detailed 
picture (a crowd), adapted from reference 37. 
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Huang reached three basic conclusions: 


1. Fixed power, square bandwidth noise is most objectionable when 
the bandwidth is approximately 0.70 cy/mrad, but is decreasingly 
objectionable as the bandwidth decreases or increases. Thus if the 
noise power is kept constant and bandwidth is increased equally in 
both directions, the objectionability increases, reaches a maximum 
at 0.7 cy/mrad, and then decreases. 

2. | Noise with horizontal streaks is less objectionable than noise with 
vertical streaks.* 

3. Noises whose frequency content is similar to the signal frequency 
content of the picture are less objectionable than other noises. 


4.6 Spatial and Temporal Integration by the Eye 


It is well known that a large target can be perceived at a threshold 
luminance lower than the threshold for a smaller target, and that the thresh- 
old luminance of a simple target is inversely proportional to the square root 
of the target area within certain limits. This phenomenon may be inter- 
preted as a spatial integration of signal and noise luminances within the 
target area, and is called spatial integration or areal summation. Using simple 
rectangular or square targets, Coltman and Anderson??, Schade?5, and 
Rosell?? have proven that for television imagery, the perceived signal-to-noise 
ratio is proportional to the square root of the ratio of the target area to the 
noise coherence area in the image. By perceived signal to noise ratio we 
mean the following. Since the point SNR does not change significantly as 
the target size is increased, the fact that the eye detects broader signals more 
easily suggests that the eye operates on a perceived SNR for the total target 
rather than the point SNR. In the case of a FLIR, noise can usually be 
assumed to be coherent over a dimension on the display equivalent to the 
detector angular subtense. 

The limits of spatial integration were determined by Bagrash, et al.*°. 
They used squares, circles, and compound shapes, and found that areal 
summation ceases for areas of approximately 11.6 mrad in diameter or 
greater. More interestingly, their results with compound stimuli strongly 
support the parallel tuned-filter approach to the eye SWR discussed earlier, 
suggesting that areal summation is actually a selective spatial frequency fil- 
tering effect. 


*Since a raster is in effect a fixed-pattern streaky noise, this may be one of the reasons that the 
horizontal raster orientation of commercial TV is preferable to the alternative vertical raster. 
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Visual persistence, or the capacity of the eye to retain an image for a 
brief time, is a common experience. We are all familiar with the blurring of a 
rapidly moving object and with the persistence of a flash of light. Many ex- 
periments to determine the threshold of visibility of pulsed optical signals 
have indicated that the eye retains or stores a signal for a finite time, and 
sums signals slightly separated in time. Experiments with live noisy electro- 
optical imagery have verified the existence of this temporal integration prop- 
erty and further indicated that the eye does not perceive the instantaneous 
value of random noise but rather its root-mean-square value taken over a 
finite period. This behavior is critical to the satisfactory performance of a 
framing sensor, and we will devote this section to the available experimental 
results. Most of these experiments demonstrate that it is reasonable to asso- 
ciate an effective eye integration time T, with the phenomenon of temporal 
integration. 

An early expression of signal integration in time by the eye is the 
Blondel-Rey law. Glasford? expressed this by: 


L= Lo (1 +921) | (4.11) 
to 

where L is the threshold luminance for a source exposed for t, seconds, and 

Loo 15 the threshold luminance for a source exposed for an infinite time. 

The value 0.21 may be interpreted as the summation time T,. 

In 1943, DeVries*! reviewed existing data and concluded that a suitable 
eye summation or integration time T, is 0.2 second. Rose^? compared the 
visual impression of noise on a kinescope with photographs of kinescope 
noise taken with varying exposures. He concluded that a good match occurred 
for an exposure of 0.25 second. Coltman and Anderson?? recommended а 
Те of 0.2 second, and the data of Rosell and Willson?? are consistent with 
this value. 

Using the data of others, Schade? deduced that T, is 0.2 at low lumin- 
ances, and decreases at high luminances to 0.1 second for rod vision and 
0.05 second for cone vision. Luxenberg and Kuehn*? recommend 0.1 second 
for display applications. Blackwell^^ investigated visual thresholds for two 
noiseless pulses of 0.0025 second duration separated by 0.004 to 
0.475 second. He found perfect summation of the pulses for separations of 
less than 0.02 second, with decreasing summation up to 0.5 second. 
Budrikis*> analyzed the data of Graham and Margaria*® who measured 
thresholds over a wide exposure range. Budrikis found that their data were 
fitted well by assuming an exponentially decaying impulse response et/to, 
with t, = 0.135 second. 
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Tittarelli and Marriott^" double-flashed a one arcminute disc with a 
flash duration of 1.4 millisecond and a flash separation of from 20 to 
140 milliseconds. They measured the threshold of resolvability of the two 
flashes and concluded that for cone vision an optical disturbance decays 
with a half life of approximately 20 milliseconds. For pairs of flashes sep- 
arated by more than 80 msec, no significant improvement in the threshold 
was noted, indicating that temporal integration is not effective on stimuli 
separated by more than 80 msec. One might be tempted to conclude from 
this that systems with frame rates of less than 12.5 fps (i.e., 1/(80 msec)) do 
not benefit from temporal integration. These results are somewhat at odds 
with other studies which demonstrated effective integration times of 0.1 to 
0.2 second. 

The conclusion which can be drawn from these studies is that in a fast- 
framing system, if the noise is uncorrelated in space from frame to frame 
while the signal remains constant, the perceived image resulting from the 
eye's combining of several frames is less noisy than a single-frame image. The 
author's experience is that measurements of observer responses to thermal 
imagery are consistent with a T, of 0.2 second. That value is used in the rest 
of this text. 

The concept of temporal integration is important when we consider 
the choice of a suitable information update rate, or frame rate, for a system. 
Since the eye appears to integrate signal and to rms uncorrelated noise 
within its integration time, if the noise power is constant from frame to 
frame and the image is stationary, an image signal-to-noise ratio improvement 
by the square root of the number of independent samples (frames) occurs. 
This effect allows us to consider trading off the observable flicker, the per- 
frame signal to noise ratio, and the frame rate in order to achieve a desired 
perceived signal to noise ratio with low flicker. 

For example, if we contrive to make the rms displayed noise amplitude 
per frame at a point remain constant while we increase the frame rate F, the 
perceived SNR improves as /FT,*. On the other hand, if the noise ampli- 
tude per frame is somehow proportional to ЕР, as іп a system where noise 
power per root cycle/milliradian is proportional to frame rate, increasing Е 
does not change the perceived SNR and F should be increased until it exceeds 
the critical flicker frequency. 


*Realistically, as the frame rate and the number of integrated independent samples approach infinity 
while the per frame noise remains constant, the contrast limited case is achieved where sensor noise 
is insignificant. 
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4.7 Perception of Framing Action 


Some of the most objectionable image defects are those associated 
with image dissection by a framing raster scanner. Ideally, a framing system 
should take advantage of the temporal and areal summation of the eye to 
produce the subjective impression that the image formed is continuous in 
space and time. The worst defect occurs when the eye can sense that the 
image is discontinuous in time. This particular perception of the framing 
action is called the flicker phenomenon, or simply flicker. 


4.7.1 Flicker 


Flicker is a complex effect whose properties vary among observers and 
are strong functions of the display characteristics. Understanding flicker is 
important to visual psychophysicists because it is indicative of the nature of 
signal processing mechanisms of the brain. Flicker has therefore been widely 
investigated in a nonelectro-optical context solely for its psychophysical 
interest. The perception of flicker is apparently a probabilistic function with 
a very narrow transition from visibility to invisibility. As the frame rate of an 
image increases, the fluctuations become indiscriminable at a well-defined 
threshold frequency called the critical flicker-fusion frequency CFF. 

Flicker must be suppressed in an imaging system because at best it is 
distracting and annoying, and at worst it may cause headaches, eye fatigue, 
and nausea. It is particularly important to avoid flicker rates in the alpha 
rhythm range of approximately 3 to 10 Hz because they may trigger convul- 
sions, or cause photic driving (stimulation of sympathetic brain rhythms) 
with unknown consequences. Obviously, flicker is a serious defect which 
must not be allowed to compromise system and operator performance. 

Flicker in an imaging display was first investigated by Engstrom*®, a 
television pioneer who varied the duty cycle, frame rate, and screen illumina- 
tion of a rectangularly-pulsed blank display. This white light optical display 
was 12 X 16 inches and was viewed at 60 inches in an ambient of about 
1 foot-candle. The display was blank. Engstrom*® found that the CFF 
depends on the angular size of the flickering display, the adaptation lumi- 
nance of the display, and the luminance variation waveform. The data shown 
in Figure 4.12 demonstrate that over a limited range, CFF is proportional 
to the logarithm of the peak luminance. Engstrom repeated the experiment 
on a kinescope with significant phosphor storage to demonstrate the effect of 
integration, and Schade® extrapolated Engstrom’s data for other values of 
phosphor exponential decay. 


The apparent luminance of a flickering field above the CFF is described 
by the Talbot-Plateau law discussed by Luxenberg and Kuehn*? and by 
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Figure 4.12 Flicker frequency as a function of screen illumination 
and duty cycle, adapted from reference 48. 


Davson*. If the equivalent perceived steady-state luminance is L’, the dis- 
played luminance as a function of time is L(t), and the frame period is Тү, 
then the Talbot-Plateau law predicts: 


Tf 
U-z- f L(t) d t. (4.12) 
f 
O 


That is, the apparent luminance equals the time-averaged luminance when 
the stimulus rate exceeds the CFF. According to Luxenberg and Kuehn, this 
law has never been found to be in error by more than 0.3 percent. 

An important effect often overlooked in system design is the dependence 
of the CFF on the location in the visual field of the flickering stimulus. It 
has been observed that the CFF increases as the extrafoveal (off-axis) angle 
increases. Thus peripheral vision is more sensitive to flicker than foveal 
vision. It is not uncommon to see a system where the magnification has been 
chosen for eye-system resolution matching, independent of flicker consider- 
ations, in which flicker at the field edge is extremely annoying. The moral 
here is to run a large display at a high frame rate, or alternatively, to keep a 
flickering display small. An effect related to flicker perception is the phi 
phenomenon, wherein scene motion when viewed by a framing sensor ap- 
pears to be smooth so long as flicker is not perceptible. 
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4.7.2 Interlace 


Early in television research, Engstrom*® showed that flicker at a given 
frame rate may be eliminated by a technique called two-to-one interlacing. 
In this scheme, a frame is divided into two sequential image components 
called fields, each of which is presented at a field rate of twice the frame 
rate. The first field contains only alternate raster lines, and the second field 
contains the remaining lines, as shown in Figure 4.13. 
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FIRST FIELD SECOND FIELD FULL FRAME 


Figure 4.13 Interlace. 


If a uniform area of the display is viewed at a distance such that the 
raster lines are unresolvable, the eye fuses the two frames to give a visual im- 
pression of spatial uniformity in that area. This technique doubles the effec- 
tive frame rate so that it exceeds the CFF. However, if the lines are resolvable, 
that uniform area will appear to “shimmer” or *''scintillate" or “vibrate 
vertically", depending on the viewer’s perception. This effect is due to the 
perception of the half-frame-period phase difference between adjacent lines, 
and is called interline flicker. Engstrom demonstrated experimentally this 
correlation of interline flicker with line-resolvability which is the cause of a 
flicker and resolution trade-off problem. Engstrom also reported that 
blinking, rapid eye motion, or rapid head motion may cause the observer to 
perceive only one field during the change. This is a very annoying effect 
which becomes more pronounced as the interlace factor is increased. The sub- 
jective impression is that the picture is “banded”. 
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If for some reason such as bandwidth conservation or TV-compatibility 
it is necessary to eliminate broad area flicker by interlacing rather than by in- 
creasing frame rate, a decision must be made. One can either magnify the 
image so that the full resolution capability of the system is used, and toler- 
ate the interline flicker, or one can demagnify the image until interline 
flicker disappears, and sacrifice resolution. Another effect was noted by 
Schade®, who observed that a finely-spaced, but pronounced, raster signi- 
ficantly reduces both noise visibility and scene detail visibility. When Schade 
eliminated the raster structure either by defocussing or by increasing the 
viewing distance, he found that the apparent noise level increased markedly 
but that details were more visible. 

An excellent summary of raster defects in imagery presented at 
30 frames and 60 fields per second with 2:1 interlace is given by Вгомп“?. 
He noted that if all lines but one are optically masked, the unmasked line 
appears stationary and non-flickering and that if all but two adjacent lines are 
masked, the two lines appear to oscillate. When a complete picture is shown 
this interline flicker is perceived only in small areas of uniform luminance, 
and interline flicker is most noticeable at the boundaries of these areas. 

Brown also described the phenomenon of line crawling which accom- 
panies interlace, wherein the lines appear to crawl either up or down 
depending on which direction the individual’s eye tends to track. Line 
crawling is observed when interline flicker is perceptible, and becomes more 
noticeable as either the picture luminance or the line center-to-center angular 
spacing increases. Other investigators have found that line crawling becomes 
pronounced to the point of extreme objectionability as the interlace ratio 
increases. 

Brown also investigated the subjective effects of reducing video band- 
width in a TV system by introducing a 2:1 interlace, while keeping the 
frame rate constant. He compared noninterlaced pictures with from 189 to 
135 lines per picture height in increments of (2)!/9 with a 225 line inter- 
laced picture to discover illuminance effects. He also compared noninter- 
laced 225 to 135 line pictures in (2)!/4 increments with a 225 line interlaced 
picture for effects of noise and related parameters. 

His most interesting conclusion was that the subjective effect of interline 
flicker to the observer is that of noise. This observation can easily be verified 
with a TV by looking closely at high signal-to-noise ratio images and noticing 
the apparent presence of vertical noise. This applies to highlight luminance 
values of from 40 to 100 fL. 

It has often proved desirable from cost considerations to interlace by 
more than two to one, but the degrading effects are not well known. 
Inderhees?? investigated flicker thresholds using an oscilloscope with a P1 
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phosphor to generate a rectangular 12 line raster with interlaces of 2:1, 3:1, 
and 4:1 (with scan patterns of 1-2-3-4 and 1-3-2-4). He used frame rates of 
20, 25, 30, and 40 frames per second, and determined the interline flicker 
threshold as a function of display luminance and interlace pattern, as shown 
in Figure 4.14. Inderhees also observed the “line crawl" or “waterfall” 
effect noted by television researchers which occurs when the interlace is 
sequential in one direction. The eye then tends to perceive each line sep- 
arately, resulting in an apparent rolling motion of the raster. 
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Figure 4.14 Threshold of interline flicker as a function of interlace and 
frame rate, adapted from reference 50. 


4.7.3 Edge Flicker 


Many thermal imaging system users have observed another framing effect 
related to simple flicker which occurs near the scan edges of a high duty-cycle 
bidirectional scanner. Points near either edge of the display are illuminated in 
rapid succession by the forward and reverse scans, and if the separation in 
time is small enough, those two events may be interpreted as a single event 
by the brain. 

Consider the simple case of a rectilinear bidirectional scanner with scan 
dead time 6 seconds per frame, frame period Те, and display width W. Assume 
the dead time is equally divided on the right and left hand sides of the scan, 
and consider the time history of the luminance at the arbitrary display point 
t shown in Figure 4.15, where points (1) and (2) represent the display (but 
not the scan) extremes. 
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Figure 4.15 Notation for edge flicker. 


Figure 4.16 shows the points in time at which (1), (2), and £ are illumi- 
nated. If the duty cycle is high and the dead time low, (2) will be illuminated 
twice in rapid succession, and the eye may interpret these two closely-spaced 
events as a single event. The same effect occurs at (1) at the other display 
edge, and may also occur at the arbitrary point £ if it is close enough to the 
display edge. 

If two successive illuminations at £ are fused, the reader may easily 
verify that the apparent frame rate F' will be lower than the time frame rate 
and is given at a point £ by: 


ВЕ -EW ) (4.13) 
? РАЈЛИ 
2 


Every point ¢ will not necessarily appear to be presented at a rate below 
the actual rate, but only those points near the edge. For example, for £ = W/2 
(ie., at either edge), F’ = F/2. This edge flicker effect is not well enough 
understood to enable prediction of its occurrence, but it is another factor 
which militates for higher frame rates and unidirectional scans. 


4.8 Lines per Picture Height and Preferred Viewing Distance 


Engstrom*® simulated 24 frame-per-second noninterlaced television 
with 60, 120, 180, 240, and infinite lines per picture height. Using display 
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heights of 6, 12, and 24 inches and display illuminations of 5 to 6 foot 
candles, Engstrom determined the viewing distance preferred by viewers. He 
found that viewers tended to select the distance at which the raster is just 
noticeable, and that at closer distances viewers objected to the raster. The 
results are shown in Figures 4.17 and 4.18. 
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Figure 4.17 Visibility of scan lines as a function of picture height 
and viewing distance, adapted from reference 48. 


The curve of Figure 4.18 shows that viewers spontaneously elect to 
view a display from a distance such that the scan lines are on 2 arcminute 
(0.58 mrad) centers. Engstrom also found that if rastered and nonrastered 
images were viewed at a distance such that both appeared to be identical, 
that distance exceeded the preferred viewing distance for the rastered image 
by approximately 50 percent. 

In a later experiment, Thompson" ^ ascertained preferred viewing dis- 
tances for rastered television imagery and for imagery with the raster sup- 
pressed by spot wobble. He found that for 480 active scan lines displayed at 
60 fields and 30 frames per second, the average preferred distance was such 
that the rastered picture subtended 7.6 degrees vertically and the non- 
rastered picture subtended 13.1 degrees vertically. This corresponds to ap- 
proximately 0.28 mrad per line with raster and to 0.48 mrad per line without. 
The former number conflicts with the results of Engstrom, but the disagree- 
ment may be due to the larger number of experimental subjects used by 
Thompson. In this regard, the distribution with subjects of the Thompson 
data shown in Figure 4.19 is most revealing. One conclusion which may be 
drawn is that a system having a fixed magnification (or viewing distance with 
a monitor) will please only a small proportion of its viewers. 


51 


142 


SCANNING LINES PER INCH 


PICTURE HEIGHTS 


e 6” 
О 12" 
x 24" 


2 ARCMINUTE LINE SUBTENSE 


2 4 6 8 10 12 14 16 18 20 22 24 
VIEWING DISTANCE IN FEET 


Chapter 4 


Figure 4.18 Resolvable number of scan lines per inch as a function of viewing 
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Figure 4.19 Number of observers who choose to sit closer than a particular distance 


for spot-wobbled and conventional pictures, adapted from reference 51. 
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For a particular class of scenes, it is obvious that reducing the lines per 
picture height indefinitely will eventually result in useless imagery. For the 
purposes of broadcast television, Engstrom concluded from his studies that 
story line and action could be followed satisfactorily when 240 lines are 
available. A line rate of 180 was marginal, 120 lines were barely acceptable, 
and 60 lines were totally inadequate. In most FLIR applications, there is no 
story line as such, but problems such as navigation and battlefield surveillance 
would seem to come close enough to the conditions considered by Engstrom 
so that his findings are significant to FLIR. 


4.9 Surround Factor 


A common experience among users of electro-optical imaging devices is 
that theoretically predicted performance is not achieved under field use con- 
ditions. A frequent cause of this performance loss is the strong influence by 
the luminance of the environment of the observer on information extraction 
from the display. The eye adapts to the average luminance to which it is ex- 
posed, so an attempt to get good eye performance by using a bright display 
will failif the display surroundings are dark, because the eye will adapt to the 
average (lower) level where it doesn’t perform as well. 

Luxenberg and Kuehn*? have summarized this problem by defining a 
surround factor F, as the ratio of the luminance of the surround to the 
luminance of the working surface. For example, the surround might be the 
interior of a cockpit while the working surface is the face of an electro- 
optical display. They assert that the surround factor influences visual per- 
formance in the following ways: 


1. А surround factor of less than 1.0 but greater than approximately 
0.1 gives optimum vision. 

2. Ап F, of less than 0.1 gives poorer vision than condition (1). 

3. AnF,ofgreater than 1.0 gives the worst vision. 


The importance of surround factor has been demonstrated conclusively 
by Kiya?? in a thermal imaging application. In this particular instance it was 
found that the target detection and recognition performance of observers 
using a FLIR was considerably improved when the FLIR display was sur- 
rounded by a uniform surface having approximately the same luminance as 
the display. This observation is important because it is typical in FLIR 
practice to have an observer view the imagery through a light-tight eyepiece 
or against a darkened background. Thus a critical consideration in specifying 
the luminance range of a thermal imaging system is knowledge of the ambient 
light level of the display environment so that an effective surround factor 
may be obtained. 
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4.10 Detection of Targets in Random Noise 


The detection of simple targets embedded in random noise is a well 
understood visual process, and the target-masking effects of two basic types 
of noise have been investigated. These are the signal-independent additive 
gaussian noise typical of high background photon detectors, and the signal- 
dependent multiplicative poisson noise inherent in optical signals and typical 
of low background detectors. The discipline called fluctuation theory 
attempts to understand how the visual system detects optical signals 
embedded in gaussian or poisson noise, and to predict the results of detection 
experiments. Legault? gives a historical account of the development of 
fluctuation theory and summarizes the major contributions. The following 
sections treat the two types of noise separately, summarize the results of the 
critical experiments, and show how the results are interrelated. 


4.10.1 Detection in Signal-Independent Noise 


The probability of detection of various simple targets masked by ad- 
ditive white gaussian noise has been investigated by Coltman?^, Coltman 
and Anderson??, Schade?5, and Rosell and Willson??. Coltman and 
Anderson generated square-waves which filled the screen of a television 
monitor, masked them by white gaussian noise, and determined detection 
thresholds as functions of spatial frequency. Schade determined the de- 
tectability of tribar targets in noisy still photographs. Rosell and Willson 
found the detection thresholds of rectangles embedded in noise on live tele- 
vision screens. Some of their experimental results are shown in Figure 4.20 
for a particular eye time constant Tę; time information update rate F; target 
displayed area А,; display noise correlation area A, (nominally one resolution 
element); noise voltage spectrum g(f); and monitor MTF Tn. All of these ex- 
periments demonstrated conclusively that the probability of detection is a 
strong function of the signal-to-noise ratio when all other image quality 
parameters are held constant. 

Coltman and Anderson, Schade, and Rosell and Willson generalized 
this relation to a broad class of conditions by applying the concepts of 
spatial and temporal integration. The effect of spatial integration is accounted 
for by assuming that the eye improves the image SNR by a factor (A, [ A)! [2 
and the improvement due to temporal integration is accounted for by a factor 
(T, E)! / 2. Then we may define* a perceived signal-to-noise ratio SNR, by: 


SNR, = SNR; (Т.Е АЈА)! > (4.14) 


*Rosell’s symbol for this quantity is "displayed" signal-to-noise ratio SNRp. That symbol is not used 
here because the author feels that that notation suggests SNRp is a quantity existing on the display, 
whereas it is a hypothetical signal-to-noise ratio assumed to be calculated by the visual system. 
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Figure 4.20 Probability of detection versus video signal-to-noise ratio 
for rectangular images, adapted from reference 39. 


where SNR; is the image point signal-to-noise ratio and 


A. -gi2 f g? (f) £,2(f) df.]"!. (4.15) 
O 


Experiments of this type are now described by the universal curve shown in 
Figure 4.21 which relates the probability of detection to Rp- The normalized 
data of Figure 4.20 are shown superimposed on the theoretical curve in 
Figure 4.21. 

This curve is the cumulative probability function of a gaussian proba- 
bility density. Denoting the probability of detection as a function of SNR, 
as "SNR, [Det] қ 


SNR, — Ц 
PSNR, [Det] = || ef" [20* gg. (4.16) 
T .oo 


The Rosell and Willson data yield a standard deviation o of 1 and a 
mean и of 3.2. A simple verbal statement of equation 4.16 is that P[Det] is 
the probability that the signal plus the instantaneous value of noise exceeds 
1.6c. Another interpretation of this cumulative probability function is that 
the probability of correctly interpreting a display disturbance as a signal 
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Figure 4.21 Probability of detection versus SNR, required 
for rectangular images, adapted from reference 39. 


equals the probability that the signal-to-noise ratio is greater than or equal to 
a decision value of 3.2. Various equivalent expressions for this are: 


SNR,-3.2 
1 -& [2 а (4.17) 
P [Det] e 5 
ра f 
1 || e£ /2 а (4.18) 
2T 

3.2-SNRp 

SNR 
__1 е8 —3.2)2 /2 d£. (4.19) 


e - SNRp^/2 ag. (4.20) 
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For SNR; > 1, #15 function can be approximated by a simpler function, 
PSNR, [Det] =] — exp[-B(SNR,-A)’ 1, (4.21) 


where many pairs of parameters A апа В are acceptable, but most simply 
A=1 and B=.15. For analytical purposes there is probably little loss in 
accuracy in using this approximate function, given the deviations of the ex- 
perimental data from the idealized form. 

The hypothesis suggested by this is that the visual system establishes a 
signal-to-noise ratio threshold as a reference to test for the significance of 
neural impulses. The effect is that low SNR optical signals are not detectable, 
but also that low level noise events are not mistaken for signals. Thus, we 
are not constantly mentally disturbed by fleeting impressions (false alarms) 
of objects which do not exist. 

It must be noted, however, that the Rosell and Willson data represent 
the composite behavior of different observers. The change in the SNR be- 
tween a low probability of detection condition and a high probability of 
detection condition for a particular individual may be smaller than the com- 
posite data would suggest, and the gaussian group behavior could be due to 
differences in individual thresholds and to changes in observer condition 
with time. The reader with access to a thermal imaging system may test his 
own thresholds by using a controllable thermal source to produce various 
SNR’s. 


4.10.2 Detection in Signal-Dependent Noise 


A source of light which emits N photons/second exhibits a poisson dis- 
tribution of emission rate with a mean value N and a standard deviation УМ . 
Therefore no optical signal is noise free; there is always noise present due to 
the random emission rate of the source. Because the mean signal to rms 
noise ratio is NA/N = УМ, such quantum noise is termed signal-dependent. 
Any optical detection system, including the eye, is ultimately limited by this 
inherent noisiness of optical signals? ! -55,5 6. 

Blackwell has reported numerous experiments which determined the 
eye's capability to extract simple signals from uniform backgrounds where 
the sources are free from other than quantum noise. These classic ex- 
регітепіѕ“ 4 ,° 7-5? determined the limits of the eye’s ability to detect simple 
targets embedded in uniform backgrounds. The targets were circular discs of 
variable size and luminance, and the backgrounds were large (10 to 60 degrees) 
uniformly illuminated fields of variable luminance. The experiments con- 
sisted of varying background (adaptation) luminance Lp, target presentation 
(exposure) time t, and target angular diameter 0. The observers' collective 
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probability of detecting the target 50 percent of the time was determined. 
All of the data are reported in terms of threshold target contrast defined by 
С= (Іт — Lg)/Lg. 

In the first of the 1946 experiments, observers were asked to choose 
among eight possible target positions for a target appearing at a known time. 
The background luminance varied between 1076 and 120 fL, the target angle 
varied between 1.05 and 35.1 mrad, and the exposure time was six seconds. 
The results are graphed in Figure 4.22. Similar results were obtained for 
negative contrasts. In the second of the 1946 experiments, both the posi- 
tion and time of target presentation were known, and exposure was extended 
to not more than 15 seconds to permit the maximum possible temporal 
integration. Target angles were from 0.173 to 105 mrad, and luminance 
varied from 1076 to 120 fL, with the results shown in Figure 4.23. 

In the 1950 to 1952 experiments, observers were asked to distinguish 
the time of target presentation from among 4 intervals for targets of known 
size and position. The luminance varied from 0 to 100 fL, the targets from 
0.233 to 14.9 mrad, and exposures from 10^? to 1 second. In the 1956 to 
1957 experiments, background luminances of 0.1 to 300 fL were used with 
target sizes of 0.29 to 16.98 mrad and exposure times of 1, 0.1, and 0.01 
second. The results of these later tests are similar to the 1946 data. It is 
instructive to plot the 1950-52 data as shown in Figures 4.24 and 4.25. In 
Figure 4.24, the functional dependences of threshold target luminance dif- 
ference on target angle for constant background clearly show that an areal 
summation mechanism is operative up to a size limit of about 10 milliradians. 
Figure 4.25 indicates that temporal integration still has a significant effect 
for exposures as long as 0.5 second. 

Blackwell drew the important conclusion from his 1946 data that the 
probability of detection as a function of contrast has the form of the normal 
ogive given by 


(cc (cem) 


= | e£ /2 dé, (4.22) 


where the standard deviation o has the value 0.5 and Стнр is the measured 
threshold contrast. 

Thus detection of signals embedded in signal dependent noise is des- 
cribable as a random gaussian process with a mechanism similar to that 
Observed for additive noise. Blackwell suggested that the average neural effect 
or excitation E of a stimulus L is a linear function of L. For the visual task 
of detecting a AL in a background Lp, Blackwell surmised that the visual 
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Figure 4.24 Contrast thresholds for one second exposure as a function of 
target size and background luminance, adapted from reference 44. 
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Figure 4.25 Contrast thresholds for a 1.16 mrad. target size as a function 
of exposure time and background luminance, adapted from reference 44. 
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system examines the distribution of the E around the mean Ер corres- 
ponding to a background Lp, and establishes a comparison criterion Ес 
which is proportional to Ep. The criterion E, is then used to test for instan- 
taneous events E to see if they contain significant deviations from Ев. The 
assumption is that every E = Ер + AE is compared to E,, and that the per- 
centage of time E exceeds E, is the probability that a decision will be made 
that a signal AL corresponding to AE is present. Another way of saying this 
is that AE must exceed Е. — Ер before the detection probability exceeds 
50 percent. This is the same mechanism postulated by Schade and by Rosell 
to explain their own experiments. 

In the later paper, Blackwell?^? observed that curves of threshold AL 
as functions of Lp, target angle, and exposure time are fitted well by equa- 
tions of the form 


AL. 42 
Lp =a, ( + 2) , (4.23) 


where a, and а, are complicated functions of exposure and target size. 

The following discussion is a condensation of a derivation by Ory$? 
which offers more insight into the problem of visual detection. We discussed 
above how the experiments of Schade, of Rosell and Willson and of 
Blackwell support the hypothesis that the visual system discriminates be- 
tween the presence or absence of an optical signal masked by gaussian noise 
in accordance with decision theory. Ory hypothesized that the probability of 
detection P[Det] (i.e., of correctly deciding that an optical signal is or is not 
present) is determined in a neural excitation space such that 


co 


| 21,2 
P[Det] = (E-ET)" /20т dE 
и T = /` (4.24) 


(ET-K)/ OT 


BV || eta, 


OO 


where the E's are neural excitations. E or & is the variable of integration, 
ET is the instantaneous total number of neural excitations consisting of 
Eg excitations corresponding to the background, and AE corresponding to 
the signal, Ет = Eg + AE. The standard deviation of the variations (noise) 
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around Ет is от. This is the probability that the instantaneous value of total 
neural excitations Ет exceeds a threshold value K. The meaning of K is that 
the eye sets a threshold which if exceeded indicates the significance or signal 
nature of the excitations. This is a straightforward extension of the Schade 
and Blackwell hypotheses, with the exception that Ory postulated that the 
significance criterion depends on the illumination level in a more complex 
way which permits the eye to function over six orders of magnitude in 
luminance. A gaussian noise is used for all luminance levels by assuming that 
the photon flux is large enough to approximate the poisson-distributed 
photon noise with a gaussian distribution. Thus, both the quantum noise 
inherent in light and system-induced electronic noise may be treated as 
gaussian. For the purely quantum noise case, 


op = УВт = //Ев + AE. (4.25) 


Ory assumes the threshold excitation to have the form K = Ep + C, where 
C depends on the noise and is given by 


С=х + yo, 2047 x + yo, + z (Ep + AE). (4.26) 


Written this way, the upper limit of the second integral in equation 4.24 
becomes 

Т-К. ДЕС. (4.27) 

OT OT 

Now the first term in C in equation 4.26 indicates an absolute lower limit 
on the detectable signal. The second term indicates a threshold in the de- 
tectable signal component which proportional to the noise, so that detection 
is dependent on achievement of a specific signal to noise ratio. The third 
term indicates a contrast dependent threshold. Thus Ory’s model predicts 
the three cases familiar from the literature: 


|. An absolute lower threshold limit on signal detection; 

2. Signal detection at moderate light levels which is a function of 
signal-to-noise ratio; 

3. Contrast limited detection at high light levels. 


Other analyses related to Blackwell’s research and data are presented by 
Bailey®! , Overington and Lavin? , and Kornfeld and Lawson??. They derive 
expressions which fit the Blackwell data and which interrelate threshold 
target luminance, the target solid angle, the viewing time, and the background 
luminance. 
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Blackwell’s original tests? 7 were conducted under idealized laboratory 
conditions, so the results are not necessarily applicable to practical contrast- 
limited target detection tasks. Blackwell reported results for practical tasks 
in a later paper^^, and DeVos, et al.5^ conducted experiments similar to 
Blackwell's original tests, but under less artificial conditions. They reported 
upper and lower limiting thresholds as compared to Blackwell's 50 percent 
threshold to indicate the contrasts required to perform practical tasks. 

The breakpoints in fluctuation theory were loosely delineated by 
Rose? ©, who observed that quantum noise variations tend to become un- 
noticeable in the neighborhood of 10? fL. At that luminance and above, a 
white surface appears smooth and noiseless. Rose asserted that statistical 
fluctuations are significant from 1076 to 10? fL, but are relatively unimpor- 
tant above 10? fL in the contrast limited regime. 


4.11 The Subjective Impression of Image Sharpness and its 
Objective Correlates 


The earliest measure of image sharpness was the two-point Rayleigh 
resolution criterion$? , the angular separation between two point sources at 
which the two points are just distinguishable under a specified set of condi- 
tions. Depending on the spread function shape, the two-point resolution sep- 
aration can be the same as another early measure of resolution, the angular 
separation of the 50 percent intensity points of the impulse response. A 
somewhat more sophisticated definition of resolution which is related to the 
ОТЕ is the “resolving power" or "limiting resolution". Limiting resolution 
is the maximum spatial frequency of an extended sine wave or square wave 
chart which can be resolved visually using a system under a specified set of 
conditions. 

A widely used objective correlate of sharpness for photographic sys- 
tems is “‘acutance’’, which under some circumstances correlates well with 
Observer estimates of image sharpness. Acutance is defined^? in terms of 
the response in photographic density units D to an object consisting of a step 
function. If the edge response of the photographic system is D(x), acutance 
A is defined as shown in Figure 4.26 by 


A=G,’/D’, (4.28) 


N 
«Жалы» үт ) (4.29) 
1 
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Figure 4.26 An edge response showing the notation 
for the definition of acutance. 


Acutance is therefore a measure of the steepness of the edge response. 
Roetling, et al.6°, have shown that acutance is proportional to Ne, so that 
two seemingly different descriptions of resolution are actually equivalent. 
Higgens®’ has demonstrated the differences between acutance and resolving 
power by generating imagery of bar charts and squares with high resolving 
power and low acutance in one case, and low resolving power and high 
acutance in another. 

We introduced the concept of the optical transfer function (OTF) in 
Chapter Three. The correlation of image quality with the MTF has been in- 
vestigated quite extensively, but the effects of the phase transfer function 
have been relatively neglected. Since the OTF is the Fourier transform of 
the system impulse response, it is evident that in a very general way, the 
narrower the LSF is, the better the MTF. Thus, resolution 15 in a gross sense 
indicative of the MTF. The limiting resolution under ideal conditions is indica- 
tive of the frequency at which the MTF drops to a low value, commonly 
taken to be between 2 percent and 5 percent modulation. 

The MTF can exhibit virtually any kind of behavior within realiza- 
bility constraints, but there are no experimental data which suggest that 
image quality is related to the shape of the MTF curve ina precise manner. 
However, experience yields some general guidelines concerning the types of 
MTF’s shown in Figure 4.27. Schade®® asserts that the approximately 
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gaussian MTF of curve A is the most desirable for television-type imagery be- 
cause it closely follows the form of the eye MTF and is adjudged to look 
natural. Further, Schade considers major departures from the gaussian form 
to give unnatural looking imagery. The MTF of curve B will cause “ringing” 
at sharp edges which may be objectionable. The MTF of curve C suppresses 
broad background features, emphasizes mid-frequency detail, sharpens the 
edge response, and in some cases may cause undershoot in the wake of small 
high intensity targets. Unusual behavior like that of curve D should be 
avoided because the subjective effects are impossible to assess in advance. 
Hopper? has observed that whenever a system's MTF falls below the curve 
of Figure 4.28, the imagery will be objectionable. The verbal response to the 
imagery will be that it looks **washed out”, it “needs more gain”, or “it needs 
more gray shades." The lesson is that N, is useful chiefly for comparing sys- 
tems whose MTF's have the same functional form. 


(A) (B) (C) (D) 
" МТЕ МТЕ 7 
f f f f 


Figure 4.27 Some possible MTF shapes. 


The most universally accepted objective correlate of sharpness is the 
quantity introduced in Section 3.14 called equivalent line number (N,), 
equivalent signal bandwidth, and other names. Schade"? first introduced N, 
and recognized its significance. In his words, **. . .the integral of the squared 
sinewave response function of a system. . .is a measure indicating correctly 
the effective width of an edge transition, correlating well with the general 
subjective impression of image sharpness. . ." Further, Schade asserted? that 
“А barely detectable difference. . .of sharpness. . .corresponds to a change 
of 3.5 percent in equivalent bandwidth." This conclusion was based on the 
data of Baldwin? ! , and of Schade. In equation form, 


№, = || T." (f) df, (4.30) 
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where T, is the overall system MTF. Note that phase shift effects are not 
included. 

The significance of № has been verified by the experiments of Scott’ 2. 
Scott used four aerial photographs showing a ground area of 530 by 530 feet 
at a photographic scale of 1 to 3000, and degraded them with gaussian spread 
functions and photographic grain, both separately and simultaneously. Photo- 
interpreters then used unaided vision or magnifiers, as they desired, to rank 
the blurred images, the noisy images, and to rank and equate noisy, blurred 
images. Scott parametrized the MTF of each by the characteristic frequency 
f. at which the MTF equals 0.61. This corresponds to a standard deviation 
for the spread function of 0.155/f., and to an М. of 0.21f,. His f,’s ranged 
from 107.7 cycles per picture width to 11.31 cycles per picture width. 

Scott concluded that in grainless photographs the liminal increment in 
f. for a single scene is approximately 5 percent, and that the threshold of f, 
discrimination for grainless photographs of different scenes is about 10 per- 
cent. A second experiment found good correlation between photo-interpreter 
performance with still aerial imagery and the photo-interpreters’ subjective 
rankings of the imagery. Elworth, et al.’*, have verified that №. correlates 
better with interpreter performance than any other resolution measure. The 
N, concept should not be applied without regard for the scene frequency 
content, however. Consider the two MTF's shown in Figure 4.29 where A 
and B have the same N,. B will provide slightly better imagery than A for 
very low frequency objects, but A will be better than B for very high fre- 
quency objects. 

We noted earlier that Schade used the data of Baldwin. Baldwin's 
experiments”! are worth considering in more detail, because he determined 
the discriminability of spread function changes in continuously degraded 
motion picture films. Baldwin produced square spread functions with width 
“а” (corresponding to an Мо of 1/2a) in a 14-degree by 19-degree field 
viewed at 30 inches. The N,'s ranged from 0.175 to 0.75 cy/mrad at the eye, 
corresponding to first zero frequencies of from 0.35 to 1.5 cycles/mrad. 
Baldwin's results are expressed in units of liminal discriminability of the 
resolution change. One liminal unit indicates a 50 percent probability of dis- 
criminability, and approximately 3 to 4 units indicates nearly 100 percent 
discriminability. Baldwin observed the dependence of discriminability on № 
shown in Figure 4.30. 
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Figure 4.28 Minimum acceptable MTF, adapted from reference 73. 
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Figure 4.29 Two MTF's with the same equivalent bandwidths. 
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RELATIVE SUBJECTIVE SHARPNESS IN LIMINAL UNITS 


0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 
f [cy/mrad] 


Figure 4.30 Liminal sharpness versus cutoff frequency for a square 
spread function, adapted from reference 71. 


Interpreting his data, we find a range of delta-N,/N, for one liminal 
increment of 0.03 (at 1/a = .35 cy/mrad) to 0.075 (at 1/a = 1.33 cy/mrad). 
Thus, for a fixed magnification system, it becomes more difficult to notice 
resolution improvement as the baseline resolution increases. That is, at 
1.33 cy/mrad (3/4 mrad system) it would take a 7.5 percent improvement 
to have a 50 percent probability of discriminating the difference, and around 
20 percent improvement to have a positively noticeable difference. Baldwin 
also varied vertical and horizontal resolution using a rectangular spread func- 
tion, and found a slight preference for better resolution horizontally than 
vertically, as shown in Figure 4.31. 


4.12 Application of Visual Psychophysics to Thermal 

Imaging System Design 

Five facts of critical importance were introduced in this chapter. These 
are that the eye: 


€ behaves like an ensemble of stochastically independent displaced 
and overlapped tuned filters which appear to approximate an op- 
timum filter for particular visual tasks; 

integrates a signal extended in space and repeated in time; 

detects and recognizes objects in a probabilistic manner; 

has contrast-limited and noise-limited detection modes; and 

is sensitive to framing and interlacing. 
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Figure 4.31 Liminal sharpness versus resolution element shape, 
adapted from reference 71. 


Each of these five facts has a corresponding action which must be taken 
in system design. These are: 


€ Assure that the system magnification and displayed MTF are ade- 
quate to bring the target spatial frequency spectrum well within 
the eye’s bandpass. 

e Allow for the beneficial SNR-enhancing effects of temporal and 
spatial integration in system design and performance specifications. 

@ Provide SNR sufficient to achieve desired performance on a proba- 
bilistic basis. 

ө Provide an appropriate surround factor and displayed luminance, 
dynamic range, and contrast to permit system-noise-limited 
performance. 

@ Eliminate perception of framing and interlacing. 


When these five actions have been taken, system performance will be 
very close to optimal. It is not necessary to eliminate defects completely 
because the eye is tolerant of minor defects in a television-type picture. It is 
readily verifiable that the perception of noise and shading is reduced when 
picture detail increases, whereas perception of contour sharpness is improved. 
Thus, minor artifacts which would be objectionable in a featureless TV 
picture tend to be unnoticed in a detailed picture. 
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CHAPTER FIVE — PERFORMANCE SUMMARY MEASURES 


5.1 Introduction 


In previous chapters, we discussed such fundamental system properties 
as the optical transfer function, the image signal-to-noise ratio, the frame 
rate, and the magnification. Each of these properties describes only a small 
part of a system’s ability to perform practical tasks. They are useful ways 
to characterize a design, but other measures which combine them in ways 
appropriate to particular tasks are needed in order to conduct performance 
tradeoffs. The ideal such summary performance measure must include the 
observer’s limitations, must be predictable from fundamental system param- 
eters, must be easily measurable, and must relate to the performance of the 
system as it is ultimately intended to be used. In this chapter we will consider 
some commonly used summary measures and attempt to ascertain how they 
are useful and how they may be misleading. 


5.2 Noise Equivalent Temperature Difference 


The oldest and most widely used (and misused) measure of the ability 
of a system to discriminate small signals in noise is the “‘noise-equivalent 
temperature difference" or “‘noise-equivalent differential temperature”. 
Several different symbols for this quantity are in current use, and we shall 
use the most common of these, NETD. Similarly, several different operational 
definitions for NETD exist. The simplest and most commonly used definition 
follows: the NETD is the blackbody target-to-background temperature 
difference іп a standard test pattern which produces a peak-signal to rms-noise 
ratio (SNR) of one at the output of a reference electronic filter when the 
system views the test pattern. 

The intent thus is to find the AT which produces a unity SNR ina 
specified filter under normal conditions when the scanner is operating. The 
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NETD test pattern is shown in Figure 5.1, where the dimension W is several 
times the detector angular subtense to assure good signal response. For 
standardization and comparison purposes, it is desirable to use an external 
measurement filter such that the product of all electronics MTF’s including 
the external filter gives a reference MTF of: 


Tg 7 [1 + (f/fg)? ]" ^, (5.1) 
where 
_ | 
fr up (5.2) 


and 
Tq = detector dwelltime. 


For analytical purposes all electronics MTF's are then attributed to this 
filter. 


BACKGROUND WITH 
UNIFORM APPARENT 
TEMPERATURE Tp К 


TARGET WITH/ 
UNIFORM APPARENT 
TEMPERATURE Tr K > ТЕК 


dc DROOP 


NOISE 
SIGNAL V LEVEL 
VOLTAGE 3 


TIME 


Figure 5.1 The NETD test pattern and the resulting voltage waveform. 
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The rms noise voltage V, is measured using a true rms voltmeter which 
operates during the active scan time. The signal voltage V, typically is deter- 
mined from an oscilloscope trace of the voltage waveform corresponding to 
the target, as shown in Figure 5.1. To achieve good results, the target 
AT = TT - Tg should be at least several times the expected МЕТІ) to assure 
that V,22 V4. The МЕТР is then calculated by: 


NETD - (5.3) 


AT 
МАДА 
5.3 NETD Derivation 


Weihe! , Hawkins? , Hudson?, and Soboleva* present МЕТІ) derivations 
using parameters each of those authors finds useful. The NETD derivation 
which follows is one which is simple and which the author finds useful in 
practice. The NETD is most conveniently derived for the case where the 
electronic processing of the detector signals has a flat frequency response 
within the system bandwidth, and where all electrical bandwidth limitations 
are due to the external measurement filter. This allows system video output 
signal and noise to be related directly to detector parameters. The resultant 
equation is easily modified for more complicated cases by adjusting the 
equivalent noise bandwidth term which appears. 

This derivation makes the following assumptions: 


e The detector responsivity is uniform over the detector’s rectangular 
sensitive area. 

€ The detector D* is independent of other parameters in the NETD 
equation. 

ө — Atmospheric transmission losses between the target and the sensor 
are negligible. 

e The target and background are blackbodies. 

e The detector angular subtense, the target angular subtense, and the 
inverse of the collecting optic focal ratio can be approximated as 
small angles. 

ө The electronic processing introduces no noise. 


T This is assumed so that the NETD equation will be simple to permit simple tradeoffs. If the detector 
is background limited, this assumption may be violated and the equations of Section 5.4 must be 
used. 
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The following quantities appear in the NETD equation: 


a,b = detector dimensions [cm] 
a, = detector angular subtenses [radian] 
Tp = background temperature [°K] 


D*(XA) = specific detectivity as a function of wavelength, evaluated at 
the electrical frequency at which the noise voltage spectrum 
of the detector is normalized to be unity, as shown in 
Figure 5.2 [cm Hz! '? /watt] 


g(f) = normalized electrical noise voltage spectrum of the detector 
[ dimensionless | 

Аб = effective collection area of the infrared optics, including 
obscuration [cm? ] 

T (A) = infrared optical transmission as a function of À [dimensionless] 

f = infrared optics effective focal length [cm] 

Afp = equivalent noise bandwidth [Hz] of the NETD test reference 


filter with the spectrum g(f) as a source: 


Aig = | в? (DFR (D df. (5.4) 
O 


Consider the stylized view of a FLIR scanning the NETD target shown 
in Figure 5.3. This target is assumed to be a diffuse blackbody radiator 
obeying Lambert’s cosine law. The target has a spectral radiant emittance 
W, [watts/cm? um] , and a spectral radiance 


Ny = М/л Е! (5.5) 
cm? um sr 


The optical aperture subtends a solid angle of А/К? measured from the 
target, so that the spectral irradiance on the aperture is Wy Ag /тК? [watts/ 
cm? um]. At any instant in time while the detector is scanning the target, 
the detector receives radiation from a target area of og R? , so the spectral 
irradiant power received by the detector as modified by the lens transmission 
то (А) is: 

У 
ES 


p, --2 20 agp? 
AT R? аВЕ то(А) 


ү 
= Ао of r0) | EE. (5.6) 
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Figure 5.2 Normalization of the noise voltage spectrum. 
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We are interested in differential changes with respect to target temperature, 
so we take 


ӘР) ов A | el (5.7) 
aT = Ao тоб) ap [лек 


The differential signal voltage produced in the detector by a target-to- 
background temperature differential is found by multiplying the above ex- 
pression by the detector responsivity R(A): 


av, 
= Тав Ay то0) RO) А T rs | (5.8) 
Since 
V, D* 
R(A) _ Ма DEQ) , (5.9) 
Jab Afk 


where V, is the detector rms noise voltage produced by Afg, we get 


ӘУ (А) a8 Ag то (A) Му D*A OW) 


5.10 
oT T мађ АҒ oT ( ) 
Integrating over all wavelengths, 
ð V, Ao Vn E: 
= -Hho Уа А рад) то) dà- (5.11) 
Ру: ab А 
Using a small signal approximation, 
AV, Ay Мр 
oP Ao Vn LA ps (A) ToCA) dA. (5.12) 


ATO T/ab Afg 
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Rearranging, we get a differential SNR? of 


Sis- AT ŻA [22 D*Q) To(A) dA. (5.13) 
n пар Де 2 oT 


Since we define МЕТР in terms of unity SNR, let AV,/V, = 1 and 
solve for the resulting delta-T which we call NETD: 


ТА / ab Afr 


МЕТр----------------. (5.14) 
об Ас || 9W) 
oT 

O 


D'Q) ToCA) dA 


Predictions based on this equation will deviate from reality by the degree to 
which the basic assumptions are violated. 

For some systems it is possible to assume that T9 (À) = constant = 7, 
within А, <A € ^, and 7,(A) = 0 elsewhere, so that 


TN ab Afr | 


NETD = (5.15) 
А 
o AoTo f aW, (Tp) , 
—57 PD (A) ал 
1 
If this is not the case, an effective transmission defined by: 
f dW) (Тв) А 
әт To (^)р (А) ал 
То = ------------- (5.16) 
ОМ (ТВ) _, 
J ~ D'O)dÀ 


Ї Alternatively, the integral in this expression may be replaced with the equivalent factor: 


+ || W (Тв) D*Q) 1009 аА. (5.17) 
*O 
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may be used. This is simply a definition and introduces no loss of accuracy, 
while it allows the NETD equation to be simplified. 

A useful form of the NETD equation results from normalizing the 
D*(X) in the integral which appears in equation 5.15 by D*(X,), pulling 
D*(Ap) out of the integral, and defining an effective change in spectral 
radiant emittance with temperature as 


А 


AWA | OW) (Тв) D*Q) ay (5.18) 
АТ > aT р) | 


Ay 


Then the simplest possible NETD equation is 


mV ab Afr 


NETD = . 
AW 
о Ас То D*(Xp) "AT 


(5.19) 


In Section 2.2, we noted that an appropriate approximation in terres- 
trial thermal imaging is: 


9W4 (ТВ) C2 
3T АТ? W) (Tp). (5.20) 


IR 


Making this approximation, 


mV ab Afr 1 


ов Ao To D*(Xp) с, Аҙ Wy (Tp) D*A) 
TE | — dar 
Тв 


NETD = (5.21) 
ж 
X A D (y) 


For the case of photodetectors having theoretical performance such that 


AD*(Ap) 
À 


D*(A) = for А < Àp, 


= 0 for A > Ap, (5.22) 
the bracketed expression in equation 5.21simplifies to 
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C2 
Тр?А 


Ар 
|| WA (Тв) da]. (5.23) 
р X 


This expression is readily evaluated in two steps using a radiation slide rule. 
For Tg = 300°K, the expression becomes: 


016 (Pu iu 
916 [ wW BooK an} (5.24) 
M y 


For à, =8 ит and Ap=11.5 ит, the expression equals 1.48 X 10-4 
20 
[ма ст“ K]. 
For an example МЕТР calculation, consider one possible implementa- 
tion of the example system introduced in Section 1.5. This system has the 
following basic parameters: 


€ Detector lens diameter = 2 cm 

Square detector element dimensions = .005 cm 
Square detector angular subtense = 1.0 mrad 
Frame rate/field rate/interlace = 30/60/2 
Number of active scan lines = 300 

Field of view = 400 mrad by 300 mrad. 


We will consider the case shown in Figure 5.4 of a parallel scan system 
using a 150-element linear gapped array. We will assume that the detectors 
are not background-limited, so that we do not have to consider the complica- 
tions of the various detector cold shielding schemes possible. We will further 
assume that the vertical scan efficiency ny is 0.8 and that the horizontal scan 
efficiency пр is 0.8, giving an overall scan efficiency ns, ОҒ 0.64. The sys- 
tem has a spectral bandpass of 8 to 11.5 jum, detectors having per-element 


1/2 AD*(X 
ПО) s of 2 X 1019 Еа Hz | and реду = M) ог А <А 
watt Ар 


р: 


We will assume that the detector noise is white within the system bandpass, 
and that the effective optical transmission то is 0.8. The dwelltime of the 
system is given by: | 


_1 sc ОЙ (150) (.64) (1) (1) 
ta АВЕ ^ (400 (300) (30) 1896 
= 2.67 X 10? [sec]. 
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The reference bandwidth is: 


d. 


-T 
А i A 2та 


= 29.4 [KHz]. 


The appropriate NETD equation to use is: 


4V ab Afg 


NETD - 


11.5 
ӘУ, р*(А) 
о8 D2 т. D'(,) A ми d 
P Do To p J oT О") ^ 


SCAN DIRECTION 
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Figure 5.4 Scan pattern of the 150 element example system. 


For the type of detector described, the value of the integral is 1.48 X 10-4 
[watt/cm? ^K]. Then the NETD is: 


4V (0.005 [cm]? (2.94 X 10^ [Hz]) 


NETD - 
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5.4 NETD Equation for BLIP Detectors 


A photodetector which exhibits performance at theoretical limits is 
said to operate in the background-limited infrared photodetection (BLIP) 
mode. The NETD equations given earlier are valid for BLIP or non-BLIP de- 
tectors, but those equations are not suitable for performing design tradeoffs 
with BLIP detectors. For example, the equation 


mV ab АҒь 


оВ A, D*A 


NETD = (5.25) 


AW 
p) To AT 


does not demonstrate the effect on D* of changing other system parameters. 
The theoretical limit for D*(A) for a photoconductor is! 3 


1/2 
D*Q) = ah (5%) | (5.26) 


where Па is the quantum efficiency and Ов is the background photon flux 
density [photons/cm? sec] incident on the detector. The Ов is given by 


О, А 
"MES |" egy (5.27) 
O 


where it is assumed that the detector is a Lambertian receiver with an ef- 

fective collecting solid angle of т, and where Qes is the effective angle to 

which the detector cold shield reduces reception of background radiation. 
Substituting for Og in the D* equation gives 


* 2A [т 2 __ Ла - 
р") = === I | ^ г (5.28) 
Оу (Тр) ал 

О 


T Multiply by У? for photovoltaics; the Ain the numerator occurs because D* is defined in units of 
[cm Hz! /watt] rather than more physically meaningful units of [ (ст Hz! 2)/(photon/cm?sec)]. 
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A parameter ПЕТ ТРОА) for unity quantum efficiency and for an effective 
background angle of 7 steradians is defined by 


1 
жж А X 
DBLP (A) “эр, BUE TÉ (5.29) 
n 
О 
Then 
* T |^, 1/2 prx 
D* (A) = | 4 | ng! ^ DEPO, (5.30) 
CS 


Substituting for D*(Ap) in the NETD equation gives 


пуа Afg О, 


* AW (5.31) 
«В Ao то та 1/2 DBLP Ap) AT 


NETD = 


Defining а cold shielding efficiency пе, in terms of the actual cold shield 
solid angle Q, used and a theoretically perfect cold shield solid angle Ор, 


А ЈА 
= 5.32 
Mes Су. (5.32) 
the NETD becomes 
qi? vV абд ар 
NETD = — (5.33) 


"T AW ` 
«В Ao To е5 па Овир (Ap) АТ 


The perfect cold shield is defined as one which limits reception of back- 
ground radiation to the cone defined by the optical system F/#. Thus 
for a circular aperture* 


*This expression is approximately correct for F/# 21.5. Otherwise the exact expression 5% = T sin? 
[(arctan (1/2 F/#)] must be used. 
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Q, = Do (5.34) 
and 
NETD ee 2 (5.35) 
МВО, то nes ni DBLP Op AT | 
Applying the same definitions to D* gives 
D* (A) = 2 F/# (попа)! ^ DBLP OU. (5.36) 


Both of these equations demonstrate the limitations of design tradeoffs 
with BLIP detectors. 


5.5 Deficiencies of NETD as a Summary Measure 
The deficiencies of NETD as a performance measure are numerous: 


ө Transients often appear on the video which are blanked on the dis- 
play, but which will cause erroneously large noise readings on the 
rms voltmeter unless the voltmeter also blanks the transients. 

@ The peak voltage level У, (shown in Figure 5.1) may be a matter of 
interpretation. 

€ МЕТР measured at the electronics is not always indicative of the 
overall system performance because there are other noise sources 
and spatial filters intervening between the point of NETD measure- 
ment and the final image. 

ө Тһе МЕТР is a measure of the total in-band noise, whereas the 
eye does not treat all types and frequencies of noise alike. 

e Designing for а low measured МЕТРО without considering the ef- 
fects on image quality can be disastrous for overall system per- 
formance. For example, it is often possible to increase the 
laboratory-measured МЕТР” by using a wide spectral bandpass, at 
the expense of poor field performance. One possibility is that the 
wide spectral bandpass could make the system susceptible to sun 
glints. Another is that if the detectors are BLIP, the high SNR's 
seen in the laboratory will be lost in the field when the signal 
response at the band extremes is attenuated by the atmosphere, 
leaving only noise-producing background. 
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e МЕТРО does not account for the effects of emissivity differences 
on SNR. 

e МЕТО is almost useless for comparison of systems operating in dif- 
ferent spectral regions because of atmospheric transmission 
differences. 


In addition, the NETD criterion does not take into account the fol- 
lowing facts: 


ФӨ The visual system acts like a low-pass filter which rejects high 
spatial frequency noise. 

ө The visual system consists of many parallel narrowband filters 
which are activated separately. Thus a target with a narrow fre- 
quency spectrum may excite only one filter, so that noise rejec- 
tion outside that filter’s band is achieved. 


Therefore one must be very careful in designing a system to be sure 
that one does not excessively filter noise to achieve a good NETD at the 
expense of a good MTF. NETD is best used as a measure of the day-to-day 
performance of a single system and as a diagnostic test. Of course, NETD 
does indicate gross sensitivity. While the difference between two otherwise 
identical systems with NETD’s of 0.2°C and 0.25°C is insignificant, the 
difference between a 0.2°C and a 0.4°C NETD is significant. 


5.6 NETD Trade-offs 


Quite often one would like to know the effects on sensitivity of 
changing a single parameter. For that reason, it is useful to recast the NETD 
equation into a form using fundamental system parameters. The equation 


T V ab Afr 
AW 


——————— (5.37) 
оВА ОУ) To AT 


NETD = 


is usually cast into forms which more specifically represent the system to be 
described. A common case is that of a parallel scanning sensor with an 
n-element non-BLIP detector array which exhibits white noise. 

For a clear circular unobscured collecting aperture of diameter Do, 
and a linear scan having no overscan and producing frame rate F, field-of- 
view А by B, and a combined vertical and horizontal scan efficiency of Ngc, 


Ao = т02/4 (5.38) 
а 
а 29270. (5.39) 


ABF 
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and 
-Tl 
Afp = 2 Ir ` (5.40) 
Then 
2V rab AB F/n q 
NETD »————————— — — . (5.41) 


AW 
(ов) 00 D* (Ap) To AT 


This equation allows o to be varied explicitly and independently for con- 
stant ab, while f is varied implicitly. 
As af = ab/f? , another version is 


2V r AB F/n та f? 


ab D D* (Ap) то 2% 


NETD = (5.42) 


This allows f to be varied explicitly for constant ab while оф is varied 
implicitly. 
As ab = of) f? , we have 


Мт AB F/n ту f 


AW | 
ag Do? D* (Ap) To AT 


NETD = (5.43) 


Неге, of is varied explicitly with constant f, and ab is varied implicitly. 
Finally, for rectilinear non-overlapped but interlaced azimuth scanning 
With a vertical array, B = nl, where I is the interlace factor, and 


2VTabAFI 
NETD = -2vTapAPIB mo ___ (5.44) 


(a)? Do? D* (Ap) ro A 


showing the square root dependence of sensitivity on interlace. 

A common application of any such equation involves deciding which 
spectral region to use for a particular problem. For example, if aperture 
size is fixed due to some vehicular or material constraint, and field of view 
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and frame rate are fixed, the other parameters can be adjusted to give the 
most cost effective or least complex problem solution. A trivial example is 
the aperture diameter comparison for two spectral regions 1 and 2, say 
3 to 5 um and 8 to 14 um, with all system parameters except D*(A) and rg 
being equal, and for equal NETD’s. In that case, 


——— = (5.45) 


NETD is a good measure of sensor performance, and a good sensitivity 
diagnostic test. However, it is not a good image quality summary measure. 
The following section discusses and derives a more useful parameter. 


5.7 Minimum Resolvable Temperature Difference 


We found in Chapter Four that four factors heavily influence sub- 
jective image quality: sharpness, graininess, contrast rendition, and inter- 
ference by artifacts. An objective correlate exists for each of these factors 
which describes its effects when the other three factors are held constant or 
are not significant. A problem arises in that the four factors are usually 
simultaneously influential, and since the equivalences of degradations in the 
correlates are not yet known in detail, it is difficult to predict system per- 
formance for a multiplicity of tasks. 

This is a common problem in electro-optical devices, but it is accentu- 
ated in thermal systems because thermal image contrasts usually don’t look 
quite like the contrasts of a visible scene, so a mental set of standards such 
as those we apply to commercial TV is difficult to acquire. We are very sen- 
sitive to changes in home television reproduction and in newspaper halftone 
photographs, but it is possible to overlook relatively serious degradations in 
FLIR image quality because of the somewhat unnatural appearance of even 
good imagery and the wide variations in system design which are customary. 

It is thus desirable to have a summary measure which includes all four 
elements of image quality, that is, one which is a unified system-observer per- 
formance criterion. There are many possible approaches to such a per- 
formance criterion based on observer threshold perceptual limitations caused 
by random noise, clutter, contrast, spatial frequency content, or observation 
time. In FLIRs, spatial resolution and thermal sensitivity dominate per- 
formance. This section concentrates on a widely accepted measure for the 
signal-to-noise ratio limited thermal sensitivity of a system as a function of 
spatial frequency. 
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Schade? derived such a measure for photographic, motion picture, and 
television systems. It is the image signal-to-noise ratio required for an obser- 
ver to resolve a three-bar standard Air Force chart that is masked by noise. 
R. Genoud® and R.L. Sendall recognized that Schade's formulation was 
applicable with only minor modifications to thermal imagery, and derived an 
expression for the bar pattern temperature difference which will produce 
Schade's SNR threshold for resolution. This functiun is now generally called 
the minimum resolvable temperature difference (MRTD), and is defined for 
four-bar rather than for three-bar targets. The noise-limited case is signi- 
ficant because an infrared imaging system exhibits its ultimate sensitivity 
when noise is visible to the observer, as it will be when the system gain is set 
high to compensate for adverse environmental or scene conditions. 

The following derivation uses the same assumptions as the preceding 
NETD derivation, supplemented by the following: 


e  Theeffect of temporal integration in the FLIR observer's eye/brain 
system is approximated by a fixed,integration time of 0.2 second. 
It is assumed that the eye adds signals linearly and takes the root- 
mean-square value of noise within any 0.2 second interval, although 
this is clearly not the exact mechanism involved, but merely a 
convenient fiction. 

e Theeffect of narrowband spatial filtering in the eye in the presence 
of a periodic square bar target of frequency іт. is approximated by 
a postulated matched filter for a single bar. This assumed eye 
filter is 


sinc =) = sin (rf/2f£q)/(rf/2fT) . (5.46) 


Schade and others have demonstrated that the eye/brain functions 
approximately as a matched filter (or in this case as a synchronous 
spatial integrator) over small areas and therefore responds to the 
two-dimensional spatial integral of the displayed irradiance pattern 
of the target. For simplicity, we will assume that the bars are long 
enough so that bar end effects may be neglected. 

@ The electronic processing and monitor are assumed to be noiseless. 

ө The system is relatively simple with zero overscan, a well-behaved 
MTF, and a well-behaved noise power spectrum g? (f). 

Ф Тһе system is operated linearly so that the response to the target 
is describable by the MTF. 

ө The image formation is assumed to be spatially invariant (sample- 
free) in the scan direction. 
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An operational definition of MRTD is the following. Consider that the 
system views the largest target shown in Figure 5.5, where the bars and 
spaces of the target are blackbody radiators of different but uniform tem- 
peratures, and the bar height is seven times the bar width. Let the system be 
adjusted so that noise is clearly visible on the display, and let the initial tem- 
perature difference between the bars and spaces be zero. Then increase the 
temperature difference until the bars can be confidently resolved by an ob- 
server viewing the display. The delta-T is the MRTD evaluated at the 
fundamental spatial frequency of the chart. Repetition at successively higher 
frequencies yields ап МЕТІ curve of the form shown in Figure 5.6. The steps 
in arriving at the MRTD at a given frequency are the detection of the test 
pattern as a square, followed by the accurate establishment of the delta-T at 
which the operator has a high confidence that he is in fact resolving the in- 
dividual bars. 


fr =f' fT = 2f fT = 4f fT = 8f" 


Figure 5.5 MRTD targets. 


5.8 MRTD Derivation 


It is desirable to preserve historical continuity and to minimize ana- 
lytical complexity by expressing the MRTD in terms of the NETD. The 
design tradeoffs become apparent, however, only when the parametric ex- 
pression for NETD is substituted into the MRTD equation. The following 
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MRT [°C] 


SPATIAL FREQUENCY [cy/mrad] 


Figure 5.6 A typical shape for an MRTD curve. 


quantities in addition to those defined for the NETD derivation are re- 
quired for the MRTD: 


те(ђ = electronic amplifier and video processor MTF 
та (f) = video monitor MTF 


t.(f) = overall system MTF 


T(P) = overall system square wave response 

Te = effective eye integration time [second] 
F = frame rate [sec'! | 

fT = fundamental target frequency [cy/mrad] 
f'T = fundamental target frequency [Hertz] 


We will derive the MRTD by considering first the effects of the system 
on the SNR in the target image, and then including the effects of the obser- 
ver. At the output of the monitor, the SNR in the image of one bar is 


А. (5.47) 


1/2 


Rr oo 
|| 22 (DT, (Тр (f) df 
O 


Afp 
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where the square wave response factor, describes the modulation loss in the 
bar image, the factor АТ/МЕТОр gives the electronic SNR measured at an 
NETD reference filter for a large target with a particular AT, and the 
bracketed term in the denominator converts from the reference bandwidth to 
the actual system bandwidth. Since 4/7 is the first harmonic amplitude of a 
unit square wave, for spatial frequencies greater than or equal to one-third 
of the nominal cutoff, 


%-4%//т. (5.48) 


For а large class of systems, this is a good approximation for frequencies 
greater than one-fifth of the cutoff frequency. 

When an observer views the target, he perceives a signal-to-noise ratio 
modified by four perception factors: 


l.  Forthis type of target, the eye apparently extracts and operates on 
mean signal in the image, as demonstrated by Schade?. For fre- 
quencies greater than one-third of cutoff, the degradation of mean 
signal level from the peak signal input is given by 27 /т or 8г;/т?, 
because 2/7 is the average value of the first harmonic of a unit 
square wave over one-half cycle. 

2. SNRisimproved by the square root of the number of independent 
time samples per display point per 0.2 second, that is, by 
(Т.Б)! ? due to temporal integration. 

3. In the vertical bar direction, the eye spatially integrates signaland 
takes the rms value of noise along the bar, using as a noise 
correlation length the dimension f, giving a perceived SNR im- 
provement of: 


(bar length/8)!2 = (7 X Баг width/8)!? = (7/2 Ту 0)“ 2 (5.49) 
4. In the horizontal direction, the effect of eye integration is most 


easily accounted for in the frequency domain by replacing the 
displayed noise bandwidth 


оо 
| g? T T df 
О 
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with a noise bandwidth accounting for the eye’s matched filter action, 


со 


|| g? T,? Ty” sinc? (f/2 fr’) ағ. 


© 


This gives a perceived SNR improvement of: 


oo 
2902592 1/2 
IE re^ Ip, df 
Ó 


| g? T.? Tr? sinc? (f/2 fr’) df 
O 


Combining these four effects with the image SNR}, the perceived SNR is 


SNR. 2— Т.---------------- [TF] 
Р л? 5 ро 1/2 е fT 
| err, df 


1/2 


| g? Te? T4? df 
0 


| тоа sinc? (6726: а 
О 


1.51 AT [T,F]!r, 


oo 1/2 
NETDg J 5 Te Tm sinc? (f/2fr') df [ств]! 
° Afg 


(5.50) 
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Letting SNR, equal a signal-to-noise ratio k in the image of one bar 
necessary for the observer to detect that bar, solving for AT, and defining a 
bandwidth ratio p by 


co 1/2 
2~2 ~ 2 . 2 ГА 
Ц g^ Те“ T° sinc wet a | (5.51) 
Afr 
we get 


k 
NETDp pi” [#78] 1/2 Т5] 


Ao туз (5.52) 


Here it must be noted that target frequency appears as fy in pl [2 with units 
of [Hertz] and as fp with units [cy/mrad] elsewhere. 

The AT which is chosen as the MRTD is associated with a perceived 
signal-to-noise ratio which is in turn associated with a probability of detec- 
tion. Blackwell found that the threshold of observer confidence is approxi- 
mately the same as the 90 percent accuracy forced-choice detection threshold. 
Therefore it seems reasonable to select the constant in the MRTD equation to 
be consistent with a 90 percent probability of individual bar detection. 
From Section 4.9.1 we have that the k for Ра = 0.9 is approximately 4.5. 
Letting k = 4.5 and AT = MRTD, we have that 


3 NETDR о’? (fp8)!? 


= : (5.53) 
E(T,F)!^ 


МЕТР (fy) = 


The author’s experience has been that this equation adequately predicts the 
measured MRTD of a detector-noise-limited, artifact-free system if all param- 
eters are accurately known. 

There are four serious problems with the MRTD as formulated here which 
make it less than perfect as a performance summary measure. The first is that 
it does not include the MTF of the eye and magnification effects. The second 
is that it is a human response measure, and some people are uncomfortable 
with a man explicitly in the system evaluation process. The third is that the 
spectral sensitivity is not explicit in the formula. The fourth is that MRTD 
does not unequivocally correlate with field performance. Thus it is most use- 
ful as a diagnostic tool and as a gross indication, per Section 10.5, that field 
performance will be as desired. 
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The MRTD concept has been evaluated in the laboratory with many dif- 
ferent systems by many different individuals. The correlation with theory 
has been good, the test is easily performed, and it is repeatable. Further- 
more, it is widely accepted because it is an easily grasped, clearly observable 
concept. The MRTD equation also demonstrates an important point about 
frame rate and flicker. In the case of a linear scan, the conversion factor from 
spatial to electrical frequencies along a scan line is proportional to the frame 
rate. Then Afp is proportional to F, NEIDmg is proportional to F!/ 2, and 
the (F)!/2 factor in the denominator removes the apparent F dependence 
of MRTD. This is a significant point because it shows that increasing the 
frame rate does not reduce the perceived SNR, and in fact improves picture 
quality by reducing ilicker. 

If one desires to have an MRTD equation which uses fundamental 
parameters rather than the NETD, one form derived by substituting NETD is: 


1/2 


sval g? (f)T2 Tq? sinc? (f/2f7) ar [#761 1/2 


MRID =—— y · (5.54) 
арАото D*(Ap) АТ % (ту! 


Finally, {һе MRTD expression of equation 5.53 сап be simplified for 
most system concepts to allow calculation of MRTD by hand. In many well 
designed systems, the displayed noise is white within the signal bandpass, or 


g? (Dr? T, ? = 1. (5.55) 


In that case, the expression for p! [2reduces to: 


f'T 1/2 
1/2 2| LL 
p ЕН (5.56) 


where Ғт has units of Hertz. To permit use of fy іп both places іп the equa- 
tion in units of [cy/mrad] we note that 


„_ Q 
о (5.57) 
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Then 
NETD 
3 — —R fT [z^ gi 
(Afg )!? Td 
MRTD = — (5.58) 
г; (T, PF)! 


This form has the desirable quality of using the NETD divided by the 
reference bandwidth. Thus the controversial part of NETD is eliminated and 
the more or less fixed parameters remain. 

For a sample MRTD calculation, consider the example system in its 
parallel scan implementation. The following MRTD parameters apply: 


NETDg = 0.36°C 


Afr = 29.4 KHz 

а = В = 1.0 mrad 

та = 2.67 X 107 sec. 
Е = 30 Hz 


MTF’s as given in Table 3.5. 


Using the MRTD equation for white noise and substituting the above 
parameters, 


fT 6 
MRTD = 0.49 — [^C] . 


Is 


The example system MTF and MRT are shown in Figure 5.7. 


5.9 Minimum Detectable Temperature Difference 


The MRTD concept is a useful analytical and design tool which is in- 
dicative of system performance in recognition tasks. However, MRTD does 
not necessarily correspond in any simple way with practical detection tasks. 
A function has been proposed by Sendall’ , Hopper? , and others which should 
correlate with noise-limited field detection performance. This function is the 
Minimum Detectable Temperature Difference (MDTD). The MDTD is not at 
present widely accepted and no conventions for it exist, but it is a useful 
concept. In deriving an MDTD equation, the usual assumptions about the 
natures of the system and the observer are made. The target is a square of 
variable side angular dimension W against a large uniform background. The 
MDTD is defined as the blackbody temperature difference required for an 
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Figure 5.7 МАТО and MTF for the example system. 
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observer to detect the presence of the square target when he is allowed 
unlimited time to make a decision and knows where to look for the target. 

Let the mean value of the displayed target image (a quantity which we 
assume the eye computes for spatial integration) be I(x,y)AT, where I(x,y) 
is the image of the square target O(x,y) normalized to unity amplitude, and 
where AT is the target to background temperature difference. Then the 
average signal-to-rms noise ratio per frame per line in the image is: 


(х,у) AT 
SNR, = —————————————— . (5.59) 


oo 1/2 
| eorex at 


NETDp | 2 


со 


|| g? (frg? df 
O 


Let the system line spread function dimension in the y direction be туі. Then 
for nonoverlapped scanning, the number of independent noise samples 
across the target image in elevation is approximately (W + Ігу 1)/B. 

Assuming the eye spatially integrates in the x direction over the dimen- 
sion W, the perceived signal-to-noise ratio is 


1/2 
Ј ORT at 
SNRp = SNR,CT E)! 2 | = 
|| g? (f)sinc? (WAR? Тр ? df 
(5.60) 


Substituting for SNR;, simplifying, setting SNRp = k, solving for AT and 
letting AT = MDTD yields 


1/2 


|| g? (f) sinc? (МО у 27,2 df 
O 
MDTD(W) =k ү NETDR 
|| g? (f) Fp’ df 
O 


I(x,y) (T F)!/2 Шаға 
, e p 


(5.61) 
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This may be cast as a frequency domain function by defining a funda- 
mental target frequency, 


-l 
fp = zl. (5.62) 
so that 
oo f 1/2 
|| g? (f)sinc? (ғ T. df 
О T 
| |NETD, 
|| 22 (Dig? df 
O 
MDTD (т =) = · (5.63) 
1 
—— — + Ir | 1/2 
T y 


(х,у) (T, F)!? 


Dryden? has noted that the MDTD can be related to MRTD by assuming 
that the square dimension W is the same as the bar width in an MRTD target 
and by rewriting the MDTD equation as: 


1/2 
1/2 — B 
k NETDg p m. 3] 


МПОІр----- ——— (5.64) 
(х,у) (ТЕ) 


Neglecting the term Iryl, we get 


V2 кМЕТРр oV? (fr)? 
MDTD (е - 1). V2 KNEDR 2 СТР). 


= ; 5. 
I(x,y) (ТЕ): 5:65) 
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Comparing this to the MRTD equation, we see that 


|] _Ts 1.5 V2 ( Е г) 
MDTD (tr = эу) "Wes MRID fT “зү (5.66) 


The difficulty of accurately predicting MDTD arises from the necessity to 
calculate I(x,y), which is the average value of a convolution integral. For 
targets much smaller than the detector solid angular subtense, I(x,y) reduces 
to the ratio of the target solid angular subtense to detector subtense. The 
MTDT equation for this case is useful for predicting detectability of targets 
which are effectively point sources. 


5.10 Noise Equivalent Emissivity 


The noise equivalent temperature difference is an idealization which 
allows one to calculate the video signal-to-noise ratio for a large blackbody 
target against a uniform blackbody background. Since scene apparent tem- 
perature differences may arise from emissivity variations as well as from 
contact temperature variations, it is useful to define an analog of the NETD 
which uses as the target a uniform emissivity difference in a broad area of an 
otherwise blackbody surface. Since the greybody area radiates less than its 
blackbody background at the same temperature, the video signal produced 
in the system by the greybody target will be negative relative to the back- 
ground level. Thus, the parameter we seek is the emissivity e which produces 
a video signal-to-noise ratio of -1 against a blackbody background. We may 
call this quantity the noise equivalent emissivity (NEE). As defined here, 
| /NEE gives the video SNR produced by a particular NEE. 

NEE is derivable by paralleling the NETD derivation. Assume that the 
target is shaped as shown in Figure 5.8. The difference between the target 
spectral radiant emittance and that of the background is: 


AW) = eW) (Tg) — Wy (Тв) 


(5.67) 
- (e — 1) Wy (Tp) - 
The change in spectral power incident on the detector is: 
(e — 1) Wy (Тв) 
Дру = ——*—>— Ao ов ro) (5.68) 
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Figure 5.8 Noise equivalent emissivity notation and target pattern. 


and the change in signal voltage is: 


(e — 1) № (Тр) Ag ag Va 
AV CON) Tere eec A) D¥*(A) . 5.69 
s (№) ab Af, то (A) D*(X) (5.69) 


Integrating over A, solving for AV,/V,, and setting it equal to -1 yields: 


Ao ep 


TN ab Afr A 


-1 =(є—1) W (Tg) D*Q) Ty (A) dA. (5.70) 


Solving for e and identifying it as NEE, 


TN ab Ағ 


NEE = 1 — n O (5.71) 


oo 


A, o || W (Tg) D*Q) To (А) dA 


О 
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Converting to the form of the NETD equation allows us to write 


“aw, (T 
9М CTB) DEN a 
oT D*A 
NEE = 1 - NETD = (5.72) 
Wy (Tg) 20У ах 


рор 


Hawkins? has defined a similar quantity called noise equivalent fractional 
emissivity. 


O 


5.11 Sensor Performance 


It is difficult to derive a single sensor performance factor which is 
meaningful for all types of designs and component characteristics because 
general performance factors do not always translate into subjective image 
quality or into information extraction capability. To simplify the problem 
somewhat, Sendall' makes the important distinction between those per- 
formance figures which are used for system synthesis and those which are 
used for system analysis. Whereas the NETD and MRTD measures previously 
derived are analytical, Sendall has suggested quantities which are useful for 
arriving at an efficient system design. 

The following derivation parallels that of Sendall for the broad class of 
systems whose sensitivity is determined by white detector-generated noise 
and whose resolution is largely determined by the detector angular subtense. 
It further assumes a circular clear aperture, contiguous non-overlapped 
scanning, single-RC low pass filter electronic processing, and square detectors. 

The equations appropriate to those assumptions are 


"ӘЗ, (T 
МЕТР = 4 Vab Afp | E p? || M. в) ко ZI (5.73) 
O 


т 1 
Mg = =. (5.74) 


and 


та = nabn,./ АВЕ (for the case of a parallel scan system). (5.75) 
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Then 
Дер = т АВЕ/4павт, с > (5.76) 
and 
2 v T ab АВЕ /поВ 0, с 
NETD = 


aw» (T 
o. D? || в) D*A) ToCA) dÀ 


О 


2УтАВЕ F/# 


- — За 7” (5.77) 
аруа те zi р D*O) ToCA) dA 
O 
Now define the sensor performance P as 
OW) (Тр) 
| V n Nse Бо oT D*A) To(A) ах 
Р = (АВЕ! о (5.78) 
af МЕТІ) NEU 


This is a good summary parameter because NETD/F!/2 is indicative of 
thermal sensitivity, (AB) /2? is an equivalent square field of view dimen- 
sion, and of is indicative of resolution performance for many systems. 
Thus reducing NETD/F!/ 2, increasing (AB)!/ 2. and reducing of increase 
performance. 

For a BLIP detector with theoretically perfect cold shielding? 


D* = 2 Овур F/# (nq Nes)!” (5.79) 


and we can write 


со 


Мп Neo OW) (Тв) 
Пас D, (Mes Ng)!” т. D(A) dx. (5.80) 


P- T 
Мт Д BLIP 


T Assuming that the detector operates at the same focal ratio as the optic, i.e., that there is no pupil 
migration with scanning. 
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If we now let т, (А) = To and define 


Vn D, 


Sensor design complexity = C A ———— (5.81) 


Мт 


Sensor efficiency = E А To Nes Пас Па) (5.82) 


and 


oo 


Sensor radiation function = R A | D** (X) dÀ, (5.83) 
о BLIP 


then P = CER. The interpretations of E and R are straightforward, and С 
shows that after E and R have been optimized, further improvement in per- 
formance can be achieved only at the expense of a wider aperture or more 
detectors. 

If the detector angular subtense does not dominate the sensor MTF, it 
is desirable to substitute for a the equivalent resolution r, in the scan 
direction 


nc Bl Lo at | uu (5.84) 


where Ty is the system MTF in the scan direction. Similarly defining гу, the 
performance equation becomes 


A B Е 1/2 
„АВРО . (5.85) 
rx ry, NEID 
As a final remark, it should be noted that the above definitions of per- 
formance clearly emphasize the constancy of performance with increasing 
frame rate. 


5.12 Spectral Region Selection 


The most desirable spectral region for FLIR operation is the one which 
maximizes performance for a given cost. For two FLIRs identical in all re- 
spects except for spectral region, their NETD's are roughly indicative of 
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relative performance except for the effects of atmospheric absorption. The 
best possible case for both spectral regions is that of BLIP detectors with 
NETD given by: 


МАГЕ 
МЕТ) = ——————————À. (5.86) 


eT AW 
D, VAP ncs tq DBLIP (Хр) To AT 


А 
aw. | Ан) р" О) gy 
~ . 
А, oT D (Ap) 
From Planck’s distribution for long wavelengths and terrestrial 
temperatures, 


dW» (Tp) ~ Со 
9T т> 


W (Тв) . (5.87) 


Since for perfect photodetectors, 


AD* (Ap) 
D*A) =—-— for Хе А 
5 


= 0 for A > Хр, (5.88) 


we have that the integral in equation 5.86 reduces to 


C 
— = —5 W, ал. 5.89 
АТ “ТВ А, А ( ) 


200 Chapter 5 


Then assuming that all other parameters are equal the ratio of the 
NETD's in two spectral regions is given by 


Х4 
A; DBLP (Aa) | W dà 


МЕТРО эм) 8 , (5.90) 
NETDA; ЭА ) | 


№, 
№ Dgrip (№) | W dr 
А 


1 


апа опе may now compare two theoretically perfect systems which аге 
identical except for spectral region. For BLIP operation in the 3.5 to 5 and 
the 8 to 14 regions, theoretical Dprp’s are: 


DIIP (14 um) = 3.27 X 1019 [cm Hz? /watt], 


and 


Dap (5 um) = 1.09 X 10! ! [cm Hz? /watt]. 


À 
The values, / PW,dA are obtained from a radiation slide rule and 
yield 1 


NET (3.5-5) _ 5 X 3.27 X 1019 X1.72 Х 102 _ 365 
МЕТ (8-14.5) 14 x 1.09 X 10!! X 5.56 X 1074 mE 


This result shows that a perfect FLIR operating in the 8 to 14 um region will 
perform better than one operating in the 3.5- to 5-um region by a factor of 
3.65 when all other design parameters are equal, and when the atmospheric 
transmissions are the same. 


5.13 Spectral Bandpass Optimization 


The choice of the approximate spectral range in which to operate is 
normally dictated by economic constraints; one conceives the possible sys- 
tems which perform the desired function, and compares their estimated costs. 
Equivalence of performance of the candidate systems may depend strongly 
on the atmospheric transmissions expected under the anticipated conditions 
of use, and that effect may dominate the sensor cost, as for example in the 
case of 3 to 5-um versus 8 to 14-шп sensors for moderate range usage. 
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Once the coarse spectral region has been selected, the problem of de- 
termining the functional form of the spectral transmission is separable into 
two cases. In the first case, the sensor noise is not background-limited, but is 
dominated by generation-recombination, thermal, and excess noises. The 
problem then is to choose a spectral response which suppresses undesirable 
spectral elements such as sunlight or expected countermeasures, or which 
emphasizes desirable target characteristics. In general, this is not an involved 
procedure. 

In the second case, the sensor is background-limited, so that the de- 
tector background and the atmospheric transmission determine the detector 
noise. Thus, in peculiar atmospheres, or in severely signal-to-noise-ratio 
limited applications, it may prove desirable to maximize the sensor SNR by 
using a cold filter which immediately precedes the detector and which has an 
appropriate spectral response. 

We recall from previous sections that 


7 aw, (Т 
SNRe f TA) тобу D*Q) М, (5.91) 


о 


where T, (A) is the atmospheric transmission. If a cooled spectral filter is used, 
it is useful to separate the optical transmission into an uncooled part 
Toy(A) and a cooled part то ,(A). Then 


B aW (Tg) 
SNR« Ta) то, (X) To (X) р) dÀ. (5.92) 
О 
For BLIP performance, we recall that 
жж T 1/2 
D*(A) = Dprip (A) та” (A) Ea . (5.93) 
5 

The BLIP limit for photoconductors is 

Difp (0) = —~—_. (5.94) 

2һсОр 1/2 

where when а cooled filter is used, 

Ов = || Toc(d) Qd . (5.95) 


о 
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Combining all of these relations, 


- 4,20) 2 Tw EB | 
Qes 
SNR« || та (Х) Tou (X) о). -----р-------4 
O 1/2 
2hc | | Тос(А) Өл | (5.96) 
As 
О, = XW (5.97) 
he 


and dropping constant factors, 


59 Т 
7,0) Tou) 19,0) 04/20) EB) ал 


SNR« о ________ ~ LL (5.98) 
оо | 1/2 
| [ Toc(A) A W) ве) | 
O 


where Твс is the absolute temperature of the cooled background. 

The problem of maximizing the ratio is now reduced to appropriate 
selection of the cold filter то (A). Kleinhans! 9 found the necessary (but not 
sufficient) conditions which maximize the SNR, and showed that the op- 
timum cooled spectral filter is binary at each wavelength. 

Kleinhans' analysis is applicable to systems for which 


f тос(№ PTA) dà 
SNRa О 


oo 1/2 
| f Toc) PBA) a| 
О 


(5.99) 
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where ®7(A) and Фр(А) are target and background spectral quantities, 
respectively. Using variational principles to maximize (S/N)?, using the 
constraint 0 < то (X) < 1, and denoting the optimum filter by 75,(A) such 
that the optimum SNR is 


А | Toc(A) PTA) dà 


So | (5.100) 
No 29 1/2 
| Toc 9 gA) da 
Kleinhans proved that 
| PTA) 
T oc (A) = 1 where) 5 80) Ub (5.101) 
and 
PTA) 
T oc = О when == Spr) <x (5.102) 
In our case we have 
AT B) 
ФтО\) = T4(X) Tou (X) а,” (№) == > (5.103) 
апа 
PBA) = АМ, (Тр). (5.104) 


The numerical integrations required to find the optimum filter тау be 
prohibitively time-consuming unless the form of the filter is approximately 
dictated by such effects as sharp spectral cut-on’s of cut-off’s due to the 
atmosphere, the optical materials, or the detector. In that case, the filter 
may take a bandpass form, and the solution for the optimum filter is 
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relatively simple. Since Kleinhans’ proof does not assure sufficiency, it is 
possible that the filter resulting from blind application of the technique may 
be less optimal than a conventionally-chosen filter. 


5.14 Infrared Optics Collection Efficiency Factors 


Often it is useful to define and calculate various optical efficiency 
factors for use in sensitivity equations. The fundamental property of a lens 
system is the transmission function T(x,y,0,X) which describes the attenua- 
tion of a ray of light of wavelength \ making an angle 0 with the optical axis 
and directed at lens coordinates (x,y). We will assume that T(x,y,0,A) is sep- 
arable into a real (as opposed to complex) pupil function p(x,y) and a spectral 
transmission T,(0,A) , 


T(x, y,A) = pGxy) T9 (0, ) . (5.105) 


The effective collecting area A, of the optic is 


- | p(x,y)dx dy (5.106) 


and average spectral transmission T, for a particular field angle 0 may be 
defined by 


hi 
ти) = 1 | тоела (5.107) 
ға 


1 


Ап effective spectral transmission тау be defined by 


°° OW) (Tp) 
| т(\) D*Q) aa ах 
n= 2 T WR т | | (5.108) 
І D*Q) — E А Br dy 


However, for a background limited detector, 7, is not a good figure of 
merit because atmospheric attenuation removes many spectral signal com- 
ponents, thereby reducing the signal-to-noise ratio. 
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A more appropriate quantity is the spectral efficiency 7: 
oW Ші ) 
[ 7909 таб) Do) AE ax 


Ts = o9 
ОМ (T 
[ таб) бә 7-5 M чу dA 


(5.109) 


5.15 Relating Performance Parameters to Operational Effectiveness 


Specifiers and designers of thermal imaging systems must be able to 
select system parameters which will satisfy the performance requirement at 
hand. The intent of this section is to outline a procedure which will reason- 
ably ensure the specifier that his performance requirements will be met. The 
experiences of military thermal imaging developments are enlightening in this 
respect. 

Many rules of thumb which were approximately valid in the early days 
of military thermal imaging do not now apply to the full range of current 
applications. For example, when most system designs followed the same con- 
vention, as in parallel-processed systems, the detector angular subtense and 
the display element equivalent angular subtense had approximately the same 
value and dominated the system MTF. 

Consequently one could speak of a "resolution", meaning detector 
angular subtense, of one milliradian, and imply a great deal about the 
system's performance. The systems were relatively coarse in resolution and 
possible recognition ranges were short. Consequently if one knew that a sys- 
tem with a resolution of so many milliradians recognized an object at so 
many meters then one could scale up or down in resolution and confidently 
infer the resulting recognition capabilities. Implicit in this practice were the 
assumptions that thermal sensitivity was unchanged, atmospheric absorption 
was unchanged, and atmospheric MTF effects were negligible. 

When ranges were extended as systems improved, and as rigid design 
conventions were dropped, the latter two assumptions were violated, recog- 
nition failed to scale with resolution and resolution in the old sense of the 
term became much less meaningful. Consequently, new performance param- 
eters and more complex performance theories had to be developed. 

The explosion of FLIR technology has immensely complicated the 
problem, so that thermal systems are now as elusive to define and standardize 
as are thermal scenes and observers. Two nominally equivalent systems may 
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have different signal transfer functions, raster effects, frame rates, and mag- 
nifications. Performance is sensitive to these variations, and it is common- 
place to find in a competition that one system is preferable to another even 
though both are nominally the same. 

These uncertainties render absurd such seemingly accurate statements as 
“with these performance specifications and parameters the system will dis- 
tinguish a man from a bush at one hundred meters in moderate clutter on a 
clear warm night". Another observer on another night using a similarly 
specified system under similar conditions probably will not duplicate that 
performance. However, users frequently must express their needs in terms of 
such statements and suppliers must then translate the ambiguities into 
specific systems parameters. The danger of this practice is that it is easy to 
overdesign by specifying system parameters to an accuracy greater than that 
to which the problem is describable. 

Any thermal imaging problem statement is affected by the forty or 
more factors discussed in Section 10.1 which influence search performance. 
Some uncontrollable variables which exhibit wide variations are target and 
background characteristics such as operating state, clutter, context, contrast, 
atmospheric transmission and MTF, and radiation exchange between the 
target and its environment. In addition, those variables which are control- 
lable by the observer will vary around nominal positions and affect per- 
formance. Some examples are surround luminance, sensor and display focus, 
and brightness and contrast settings. Other factors which are less under the 
control of the observer and which have enormous impact on performance are 
motivation, frame of mind, and information extraction skill. 

Numerous procedures exist for predicting the field performance of 
thermal imaging systems. However, given the strong variables in any tactical 
situation, it is the author’s opinion that no thermal imaging model will ever 
accurately predict a priori the results of a field test. Successfulex post facto 
theoretical explications of field test results should not obscure the fact that 
target acquisition is too complex a phenomenon to measure completely, 
much less to predict accurately. This is quite clear when the variations in the 
predictions of two widely differing models fall within the envelope of the 
measured performance variations. 

System specifiers, nonetheless, must know what kind of system is 
needed to perform a tactical function, and cannot always confidently rely on 
a paper analysis of a problem. Specifiers also understand from experience 
approximately how sensor variations affect field performance, and may be 
reluctant to use theoretical constructs rather than their own judgment. The 
practical solution to this dilemma is to perform a field test under the condi- 
tions of interest with a sensor having characteristics similar to those which 
are suspected will do the job, and to extrapolate up or down in sensor per- 
formance to get the desired performance. 
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This approach is practical because standard imagery and empirical data 
are readily available which indicate the effects of differential performance 
changes. This is not to say that we should not continue searching for simple 
performance summary measures which correlate with field performance, but 
that we should.not rely on analysis alone. 

Basing system design on extrapolations of field data is impractical, 
however, when all system parameters continue to be treated as design variables. 
This makes it difficult to assess the effect of varying one or two parameters 
when the others are not constant. Those who specify systems can have simple 
design criteria only if some variables are fixed. The following conventions are 
suggested to make it possible: 


l. fix the per frame, per resolution element, displayed МЕТР at 
0.2°C except for the most severe applications. 

2. require image uniformity at least as good as commercial television 

quality, and preferably with no raster and no fixed or moving 

pattern defects. 

fix the effective frame rate at 30 fps. 

fix the magnification at an optimum value of aM = constant. 

do not allow a thermal maximum dynamic range of less than 20°C 

above ambient. 

6. require system MTF's to be better than or equivalent to a gaussian 
MTF with a sigma of one-half of the detector angular subtense. 


MW Bw 


These conventions are in some cases already inherent in good FLIR practice. 


5.16 Summary of Primary FLIR Image Quality Analysis Equations 


It is useful at this point to summarize what has been proposed so far 
about FLIR image quality measures. Two functions and one parameter are 
generally believed to provide a good first-order estimate of thermal imaging 
system quality. The functions are the minimum resolvable temperature dif- 
ference (MRTD) and the modulation transfer function (MTF), and the param- 
eter is the noise equivalent temperature difference (NETD). The MTF and 
summary measures derived from it correlate well with the subjective im- 
pression of image sharpness in noise-free imagery. The MRTD describes four- 
bar resolving power in noisy imagery and is related to target recognition 
capability. The NETD is a convenient summary measure of broad area 
thermal sensitivity, but is not related in a simple way to any practical task or 
subjective correlate of image quality 

A typical FLIR has five component contributions to the overall system 
MTF: the optical blur, the detector geometry, the detector electrical fre- 
quency response, the electronic signal processing response, and the display 
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blur. The optics in most FLIR designs are diffraction-limited on-axis, and 
usually have circular unobscured apertures. For this case, and for entrance 
pupil diameter D, and mean wavelength A, the MTF is: 


ғ, =Z nes (5) - (>) | - and | (5.110) 


where f. = D,/A is the absolute cutoff frequency in cycles/radian. 
The detector responsivity contour is usually approximated by a rectan- 
gular function of angular dimension а in the scan direction. The MTF is: 


~ _ sin (raf) 
d^ c (5.111) 


The frequency response of FLIR detectors usually behaves like that of a 
single-RC circuit with characteristic frequency Ё; given by 1/(2т rg), where 
TR is the responsive time constant. The MTF is: 


~ f А -1/2 


Electronic processing MTF’s are composed of contributions from pre- 
amplifiers, post amplifiers, boost circuits, and video processors. For analysis 
purposes one usually assumes an overall electronics processing MTF of unity 
within the bandpass, where any rolloffs have been compensated by the boost 
circuit. 

The video monitor is usually a TV-type monitor and is adequately 
represented by a gaussian MTF with a monitor spread function standard 
deviation oq, related to the shrinking-raster line separation s by 


Om 7 0.54 s. (5.113) 
The MTF is 


Tm = exp (–2п 0m f? ). (5.114) 
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The reference NETDg for the case of a clear circular aperture and a 
rectangular detector is given by 


_ 4Vab Аїр 
NETDg зе (5.115) 


(Тв) D*A 
Vn ав DD Ap fr oo Seen 
О 


where a and b are the detector linear dimensions, а and f are the detector 
angular dimensions, Dj is the optic diameter, Afp is the reference elec- 
tronic equivalent noise bandwidth, andn is the number of integrated detec- 
tors in the case of a serial scanner. For a parallel processor,n = 1. The noise 
reference bandwidth Дїр is given by 


Дер = | 22 (f) Tg? df, (5.116) 
О 


where g(f) is the noise voltage spectrum into a reference filter having ап 
MTF of Tp. 

The MRTD is predictable by many similar and equivalent equations. 
One of these is 


1/2 
%22 
3 МЕТРр Г 5 “(ут 215іпс? (f/2fq)df ЕСЕ 
MRTD = Afr (5.117) 
[^ (T, F)!/2 


where fT is the MRTD target frequency in appropriate units, Te is the eye 
integration time, and F is the frame rate. 

For the special case of a gaussian system MTF and component MTF's 
such that there is white noise on the display, this MRTD equation reduces to 


MRTD = Ак (fp [cy/mr]) (а/та) ? g! ? 
(Дғр )!? 


exp (27? og fp?) (T,F) 32 (5.118) 


where Tq is the detector dwelltime. 
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The primary usefulness of the MRTD function is that it is an excellent 
summary measure which indicates both low frequency thermal sensitivity 
and high frequency limiting resolution. MRTD provides a good method for 
comparing systems because it includes the effects of noise, resolution, tem- 
poral integration, spatial integration, and spurious image defects. In addition, 
MRTD should correlate well with practical tasks such as target recognition 
and identification in noisy imagery. MTF, on the other hand, correlates 
with recognition and identification performance in noise-free imagery. De- 
pending on the task, the integral of the MTF, the integral of the square of 
the MTF, or the MTF 5 percent modulation frequency may correlate well 
with performance. 


Performance Summary Measures 211 


REFERENCES 


ү К. Weihe, “Classification and Analysis of Image-Forming Systems”, Proc. IRE, 47, pp 1593-1604, 
September 1959. 


21 А. Hawkins, “Generalized Figures of Merit for Infrared Imaging Systems", Report DRL-TR-68-12, 
Defense Research Laboratory, University of Texas at Austin, February 1968. 


3R.D. Hudson, Infrared System Engineering, Wiley-Interscience, 1969. 


МЕ. Soboleva, “Calculation of the Sensitivity of ап IR Scanner", Soviet Journal of Optical Tech- 
nology, 37, pp 635-637, October 1970 (English Translation). 


?O.H. Schade, Sr., “Ап Evaluation of Photographic Image Quality and Resolving Power", JSMPTE, 
73, pp 81-120, February 1964. 


6R, Genoud, personal communication, Hughes Aircraft Company, Culver City, California. 

7R.L. Sendall, personal communications, Xerox Electro-Optical Systems, Pasadena, California. 
865. Hopper, personal communication, Texas Instruments, Inc., Dallas, Texas. 

9E. Dryden, personal communication, Aerojet Electro-Systems Corporation, Azusa, California. 


10W A. Kleinhans, “Optimum Spectral Filtering for Background-Limited Infrared Systems", JOSA, 
55, pp 104-105, January 1965. 


CHAPTER SIX — OPTICS 


6.1 Optical Elements 


The simplest categorization of an optical system is the distinction be- 
tween a refractive and a reflective system. The selection of a reflector, a 
refractor, or a combination of both (a catadioptric or catoptric) for a 
particular requirement is complicated by the fact that each type has merits 
and faults. 

For a specific lens diameter and focal length, the reflection and absorp- 
tion losses of a refractor will be greater than the absorption losses of a re- 
flector, but the overall collection efficiency of the reflector may or may not 
be better depending on the amount of central obscuration. A reflector tends 
to be lighter in weight and less expensive than an equivalently sized refractor, 
but the reflector may not have the same image quality. For example, 
spherically-surfaced reflectors cannot be corrected for off-axis aberrations 
without use of a refractive corrector plate, whereas spherically-surfaced re- 
fractors facilitate aberration-balanced optical designs. Since spherical sur- 
faces are less expensive to produce than aspherics, this is an important 
consideration. On the other hand, refractors suffer from chromatic aberra- 
tions, while reflectors do not. The prevailing bias in FLIR design is in favor 
of refractive systems since so many ingenious concepts exist which enable 
good quality to be achieved in a compact package. However, the cost ad- 
vantages of reflectors make it necessary to re-evaluate that bias each time a 
new optical system is conceived. 

Beyond this simple distinction it is difficult to discuss optics inde- 
pendently of scan mechanisms because the two are closely related, the type of 
scanner chosen dictating the optical design to a great extent, and vice versa. 
Thus a brief discussion of scanners is in order. There are two types of scan- 
ners: convergent beam scanners and parallel beam scanners. 
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A convergent beam scanner operates in a part of the optical system 
where rays are converging, or approaching a focus. A convergent beam 
scanner typically is placed between the inner-most focusing refractor and the 
detector array. A parallel beam scanner operates in a part of the optical sys- 
tem where the rays are parallel or collimated. If the scanner is located in 
front of the optical system, the term "object plane scanner" applies. A 
parallel beam scanner may also be placed between a relay lens pair. Two 
examples are shown in Figure 6.1. 

Which type of scanner is preferred for a particular application depends 
on what trade-offs can be made. A convergent beam scanner generally re- 
quires fewer elements in its optical system but the scanner introduces 
aberrations and distortions into the convergent beam which must be 
corrected. 

A parallel beam scanner generally requires a more complicated optic, 
but image quality in a parallel beam scanner is not as sensitive to deviations 
from scan mirror flatness as is image quality in a convergent beam. An object 
plane scan element must be as large as the maximum projected dimension of 
the entrance pupil, so it is usually desirable to confine parallel scanning to 
the interior of an optical system. With these distinctions in mind, we may 
proceed to the discussion of optical systems. 


6.2 Basic Optical Processes 


There are two idealized processes of interest to us here: refractionat the 
interface between two nonconducting transparent media as in a refractive 
lens, and reflection at the surface of a perfect conductor as in a mirror. Con- 
sider Figure 6.2, which shows an interface between two nonconductors. 
Each material is characterized by a parameter called refractive index n, the 
ratio of the speed of light in the material to the speed of light in a vacuum. 
For nonmagnetic materials, n is the square root of the dielectric constant 
evaluated at the optical frequency of interest. The refractive indices of the 
useful thermal imaging materials are given in Section 6.8. 

Figure 6.2 demonstrates Snell'S Law of refraction, which is that for 
angles of incidence 0 and of refraction 0' measured from the surface normal, 
and for refractive indices n and n' 


(6.1) 


Or 


? 


0' = arcsin (2 sin jJ (6.2) 
n 
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(a) 


(b) 


Figure 6.1 (a) A parallel beam and (b) a convergent beam scanner. 
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Figure 6.2 Refraction. 


A simple way to remember this effect is to note that the refracted ray is bent 
toward the normal when n' n. 

Reflection by a perfect conductor (a mirror) is demonstrated in 
Figure 6.3. The law of reflection is that 


0-60” (6.3) 


6.2.7 Тһе Wedge 


А useful optical component is the refractive wedge depicted in 
Figure 6.4. The wedge apex angle is W, the angle of the incident ray is 4), 
the normal to the incident surface is Мі, the angle of the exitant ray 15», 
and the normal to the exitant surface is N,. The indices from left to right 
are n,,n,,n3. The ray deviation produced by a wedge is derived as follows. 
With reference to the symbols defined by Figure 6.4, using Snell's law at the 
first surface gives 


sin (Ф, — КМ) Dm 


ѕіп (фу = №) no’ (6.4) 
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Figure 6.3 Reflection. 


and the second surface gives 


sin (фз — №) _ 12 


п (Ф —N2) n (6.5) 
Combining 2quations 6.4 and 6.5, 
nj , n, . 
sin (фз — №) =—— sin (6; — №) ^u sin (6; — Ni — W) 
3 3 
n2 , 
=- sin (9 — Ni) - W] 
3 
n, , @ n, , 
=— sin (arcsin E sin ($4 — М, | — wi. (6.6) 
n3 П; 
Then 
nj . [m , 
фа = arcsin E sin (arcsin E sin (ді — М, | —W | +N,. (6.7) 
3 2 
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М, =N, +W 


n N 


Figure 6.4 Refraction by a wedge. 
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For small (ді — N, ), for n, >n,, and for small W, 


n 


1 n2 
фз = (Ф. - Ni) – = W + N,. 
3 


n3 
Бог п, = п; = 1, 
дз =o, — W (n; — 1), 
and the ray deviation e is 
є=фз —¢, = —W (n; - 1). 


For germanium with n, = 4.003, 


(6.8) 


(6.9) 


(6.10) 


| , . ] , 
Q4 = arcsin {4.003 sin (arcsin оозе (ф, — №, | - vj + М,. 


For W = 10 degrees and an equilateral wedge, 
N, = —5 degrees, N, =+5 degrees, and 


фз =arcsin |4003 sin| arcsin [.2498 sin (ф, + 5°)] — 10°} 


+ 5° 


~ ф, — 10° (3.003) = ф, — 30.037. 


Selected values for this case are: 


Фф, фз exact 
0 -32.526° 
5° 226.564 
10° -21.026 
15° -15.840 


03 


approximate 


-30.03° 
-25.03 
-20.03 
-15.03 


Thus the approximation is not very accurate even for these small angles. 

A wedge is a useful component because rotation of the wedge with a 
selected exitant ray as the axis causes the incident ray in object space to 
scan in a circle. This property may be used in scanners, as discussed in 


Section 7.5. 
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6.2.2 The Simple Lens 


The simplest possible refractive focusing optical system consists of a 
single spherically surfaced lens defined as shown in Figure 6.5. The simple 
lens is characterized by four numbers: the radii of curvature К, апа К,, the 
refractive index n, and the thickness t. The usual sign conventions are that 
the sign of the radius of a surface is positive when the center of curvature 
is to the right of the surface, and that the sign of the radius is negative 
when the center is to the left of the surface. R, is the radius of the left- 
most surface and R, is the radius of the rightmost surface. 


REFRACTIVE 
INDEX n 


Figure 6.5 A spherically-surfaced simple lens. 


The simple lens is called a thin lens when the thickness t is negligible. 
We are interested in the three modes of imaging of the thin lens which are 
shown in Figure 6.6. In the first mode, parallel rays of light from an infinitely 
distant point source are focused to an image at a distance f behind the lens. 
The quantity f is called the focal length, and to a first order approximation f 
determines the shift s in the image position of an infinitely distant object due 
to a small angular beam shift o, 


s=of, (6.11) 


as shown in Figure 6.7. 
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FOCAL PLANE 
(CURVATURE = 15 cm) 


R, = 5.667 ст 


| f=5cm 


Figure 6.7 Focal property of a practical infrared lens. 


In the second mode, a point source at a distance f in front of the lens is 
converted to a beam of parallel rays leaving the lens. In the third mode, a 
point object at a distance O neither at infinity nor at f is imaged as a point at 
distance i given by the thin lens equation 


14101 

11-І, (6.12) 
where 

A 1 1 

12 n- 1) (+-+ (6.13) 

It2 (n I x) 
Or 

В.В 
———————— 2 ___ (6.14) 


(n— 1)(R; — К)) 


This relation is derived in Section 6.4. 
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The importance of the magnitude of the refractive index of a potential 
lens material is evident from this equation. The higher the index, the larger 
the radii may be for a particular focal length. This makes it easier to manu- 
facture and to uniformly antireflection coat the lens, and makes the lens 
thinner, giving less absorption loss. 

Figure 6.8 shows all of the possible cases of a thin lens where a lens is 
considered positive or convergent when an incident parallel beam is focused, 
Or as negative and divergent when an incident parallel beam diverges. Note 
that two of these lenses are reversals of two others, so that there are only six 
thin lenses. The focal length of a thick lens with thickness t is 


R,R | 
f£ 2———————— ______. (6.15) 


(n — 1) (Б —R,)+ ww 


The change Af in lens focal length from the zero thickness focal 
length fọ with increasing thickness is readily shown to be 


fo 
Af = — —— (6.16) 


t . 
(К, -- R, Jat | 


The lens law for the thick lens is derived in Section 6.4. 


6.3 Generalized Lens Systems 


Thick lenses and combinations of lenses are conveniently described by 
using an equivalent fictional system in which all refractions occur at two 
planes, called the first and second principal planes, shown in Figure 6.9. The 
intersections of the principal planes with the optical axis define first and 
second principal points H and H'. A collimated beam incident from the right 
comes to a focus at the first focal point F, and a collimated beam from the 
right focusesat the second focal point F'. The distances from the focal points 
to the principal points are the first and second focal lengths f and f' as shown 
in Figure 6.10. 

When object space and image space have the same index, and if object 
distance O is measured from H, and image distance i is measured from H', 
the lens law remains 


Ll. (6.17) 
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А, R2 


------------- |В,| = В. | = 
f (n—1 (R, Вл)“. 11= n, ЈА, | = 6 


For В, > 0, А, < 0 


ry f5 


Бе) (ri tr) 


> 0 (converging) 


double convex 
° or plano convex or 


For В, >0, В, >0 


ry f2 
f= — 
(n — 1) (г) — ri) 
г >г,,Ғ>0 (converging) гі >г;,Ғ<0 (diverging) 
| , plano 
meniscus or meniscus concave 


For В, <0, К >0 


LAE <o (а 
f = — — f <0 ivergin 
(n — 1) (ry +) ging 


double or plano 
concave concave 


Figure 6.8 Simple lenses. 
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Figure 6.9 Principal points and planes. 
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Figure 6.10 Image and object distances for a generalized lens system. 


When object distance is denoted by x and is measured from F, and image 
distance is denoted by x’ and measured from F', the lens law becomes 


ҒҒ-хх. (6.18) 


The locations of the principal planes are calculated as described in Section 6.4. 

An optical system typically is characterized by three parameters: the 
field of view, the effective clear aperture diameter, and the effective focal 
length. The field of view of an optical system consists of the shape and 
angular dimensions of the cone in object space within which object points 
are imaged by the system. For example, a field of view (FOV) may be 
described as being rectangular with dimensions of 4 degrees wide by 
3 degrees high. The FOV is usually defined by a field stop, an opaque aper- 
ture placed somewhere in the optical system to block rays which originate 
from outside of the desired FOV. Figure 6.11 shows two typical field stops. 
The FOV of a FLIR system, however, is not always determined by the FOV 
of its lens, because the scan angles or the portion of the video selected for 
display may further limit the field coverage. 
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Figure 6.11 Typical field stop arrangements. 


The entrance pupil consists of the shape and linear dimensions of the 
area over which the optical system accepts radiation. It may be a real aperture 
or a projection of an aperture stop inside the system. The ratio of focal 
length to clear aperture diameter D, is defined as focal ratio or F/number, 


F/£ = f/D, - (6.19) 


If a small radiation-detecting element of dimension “а” is placed at the focal 
surface, the angular projection о of this detector through the optical system 
is given by 


«=. (6.20) 


6.4 Matrix Representation of Lens Systems 


The principle of operation of an optical system may be perceived 
rather quickly by performing a geometrical raytrace using the matrix repre- 
sentation summarized here. Klein! is a good source for this technique. Any 
ray is completely characterized at a point (x,y,z) by prescribing the projec- 
tions onto the x, y, and z axes of a unit vector tangent to the ray at the point. 
These projections are called direction cosines because they are the cosines of 
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the angles the unit vector makes with the axes. The direction cosines are the 
components of this unit vector and are denoted by (l,m,n), as shown іп 
Figure 6.12. 


RAY PATH 


Figure 6.12 Direction cosines. 


Raytracing is the process of following an incident ray specified by 
(x, у, 2; 2, m,n) through a system until it exits аѕ(х', у’, 2; 2’, m’, n). Ап 
all-refractive system is easy to analyze when attention is confined to merid- 
ional rays, those rays contained by a plane which includes the optical axis. 
Then any incident ray can be considered in a coordinate system in which the 
ray motion is confined to two directions. Since all the ray changes consist of 
discrete refractions and of translations, the two directions are independent of 
each other. Thus we may consider ray motion in x together with its direction 
cosine &, y with m, and z with n. In a reflective system, however, x апа £ de- 
pend on y, z, m, and n because a reflection couples the coordinates. 

Figure 6.13 shows a ray translation from a point y, to a point y, 
with constant direction cosine. This translation is represented by 


fy l di2 Yı 
= | (6.21) 
m, 0 ] m, 
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Or 
Уз =yi + 41; M; (6.22) 
and 
m, = №. (6.23) 
y 
У2 
942 
m; 
Yi 
ni 


Figure 6.13 Ray translation. 


Refraction at a surface of radius R going from index n to n' was shown in 
Figure 6.2 and is given by 


y 1 0 у 
— (6.24) 
m’ —p n m 
BEN п! 
where р is called the power of the surface and is given by 
p ==. (6.25) 
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Refraction by a thin lens going from index n, to lens index n, to 
index n4 as shown in Figure 6.14 is described by: 


y2 | 0 yi 
_ , (6.26) 
1009; — p ni m, 
n n 


where p is called the power of the lens and is given by 


Ny = П йз = П 
В, В, 


(у, ті) п =4 RADIUS = 10 ст 


пз = 1 
(уз, m2) 3 


RADIUS = 15 ст 


Figure 6.14 A numerical example of refraction through а lens. 


The action of a thin lens on rays terminating a distance d to the right of 


the lens is* 
| | | С | ) 
0 1 р 1 m 


ш — dp) + dm 


(6.28) 
m — py 


*Recall that matrix multiplication is associated but not commutative, so the sequence of the matrices 
is significant. 
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8.04 cm (ya, тд) 
= у2 = 2 ст уз = 1.85 уд = 1.85 BACK-FOCAL 
У = 20m 2 3 LENGTH = 8.04 
M,=0 М2 = —.15 Ма = —.15 M4 = —.23 
X = gg X = ~13.3° f — 8.687 


Figure 6.15 A ray trace calculation for an on-axis object at infinity. 


For a ray parallel to the optical axis (that is, from a source at infinity), 
т= 0 and у =y—dpy. The image ray intersects the axis (у = 0) when 
а = 1/р. This distance is called the focal length f and is given by 


no -n n4 —n 
үйне Т к (6.29) 


In air this becomes 


Hele) = 


Figure 6.15 shows a numerical example of a simple lens ray trace. 

The action of a simple lens may be derived from the basic processes of 
refraction and of translation as follows. As shown in Figure 6.16, let the light 
be transmitted from left to right, passing through indices n, , n5, and пз. Then 
the matrix equation is 


у» 1 0N у t 1 0 y 


ms -P2 П; 0 l -Pı П; 


Пз n3 169; П2 
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where 
n, -n n, — n 
pi = = and р = : R, . (6.32) 
1 
This reduces to 
pıt nı 
__ ——t 
Уз 1 n; n; 
yı 
_ е . (6.33) 
m P2 pi рур; n, пр 
3 = — — = — __ 
n3 1; 0203 n3 02 3 
(yi, mi) 


(уз, m3) 


в 


Figure 6.16 Notation for the derivation of the matrix equation 
for a simple lens. 


Thus the power py of the lens is given by 


t 
рре- B BE (6.34) 


n3 N3 П n3 
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and the focal length f by 
1 Pi р; Рр! 
— = = + — ee 
f пз 0з пп» 


n -n ny — Пз (m = 0) (02 - na )t 


E n3R, o R + К, К, п, п, (6.35) 

If the lens is іп air, п, = n4 = 1 and 

1l. 1 1 | Сб — DDt 

f7 (M2 -D it К, + К, К, | (6.36) 
When the thickness is negligible, 

1 _ i l 

f (п, — 1) (x; ш К; | , (6.37) 
as before. 


Klein! shows that the thick lens may be treated as a thin lens by finding 
the principal planes, translating rays into and away from these planes, and 
using between the principal planes the matrix 


1 0 
-l n (6.38) 
п; йз 


where f is as given before. The first principal plane is located a distance A, 
from the apex of the first surface given by 


m (M-m) tf (= (6.39) 


І 153 В, na 


where A, > 0 indicates location to the right and A, < 0 means location to 
the left. The second principal plane is located a distance A, from the second 


surface by 
(п — nı )tf 


A, = “Rim (6.40) 


Optics 233 


In air 
n—1 tf 
A, — -- n В, * (6.41) 
and 
_ n-—1 tf | 
А, =— Т R, (6.42) 


The typical locations of the principal planes for the six simple lenses are 
shown in Figure 6.17. 

Two thin lenses separated by a distance d (or two simple lenses whose 
principal planes are separated by d) have an effective focal length f; 2 given 
by: 


| 1,1 4 
Lf ЫЕ: (6.43) 


The principal planes аге located at 


A, =-=, (6.44) 


and 


А, =. (6.45) 


The matrix for a two-lens telescope is: 


ЕЕ 2 [fi fi + f, 
M = . (6.46) 
0 ғ, /f, 


'$esua| ejduuis 9 10j seuejd jediouug /|'9 әлпб! 4 
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Ch — = “у 
= у 3-- = Ту 
— © 
09 = 8 001 = 8 
09 = !H 
| | | 
—. - _ l 
| Q2 dv 09— = *g 
у El = 'v 
0g = © Е = ty —7 
09 = "4 
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6.5 Matrix Representation of Plane Mirrors 


The action of a plane mirror when the direction cosines of the normal 
pointing into the mirror are (L, M, N) is described by 


0, 1—21? -2ІМ —2LN Q, 
m, | = | —2LM ] —2M^ | —2MN m, (6.47) 
Ny —2LN -2ММ l- 2N? ni 


As an example, consider the case of a mirror whose centroid is the y axis and 
which is rotated about its centroid parallel to y, normal to z, and with x = 0, 
as shown in Figure 6.18. Then 


= cos Ө, 
М = 0, 
and N = cos (7/2 — 0) = sin 0. 


The direction cosines after reflection are 


Q l —2cos?0 0  -—2sin0 cos V / £ 
m] =] 0 1 O mg 
n —2sin@cos@é 0 1 —sin?0 No 
or 
g = (1—2cos?0)£, — 2(sin 0 cos 0)ng, 
m - mg 
and n = (2sin@ cos 0)£, + (1 — 2 sin?0)ng. 


Because the mirror couples the direction cosines, it is necessary to treat 
the system using coordinate and cosine pairs up to the point of a refraction, 
to treat the refraction using all the cosines, and then to resume using the 
simpler notation. This gets so involved, however, that if the aim is a general 
understanding of the system operation it is much simpler to do a graphical 
raytrace. For the same reason it is usually clearer to perform a geometrical 
analysis of a mirror system if one wishes to derive equations for the system. 
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Figure 6.18 Reflection by a plane mirror. 


6.6 A Detector Lens for the Example System 


We will consider only refractive optics here because their use provides 
greater design flexibility and because they are more frequently used than 
reflectors. Spherical and flat surfaces are used almost exclusively because 
they are easier and less expensive to fabricate than aspherics. Aspherics tend 
to be used only in optics such as zoom systems where the use of spherically- 
surfaced-refractors would result in excessive complexity and transmission loss. 

An ideal lens converts a converging or diverging spherical wave into a 
converging or diverging spherical wave, but all real lenses produce wavefront 
aberrations which cause deviations from the ideal. The primary wavefront 
aberrations are focusing errors, the Seidel aberrations (spherical aberrations, 
astigmatism, field curvature, distortion and coma), and chromatic aberra- 
tions. Detailed discussions of these and higher order aberrations and of 
designing for minimum aberration are treated extensively in many texts on 
optics, and in the context of infrared systems by the standard infrared texts. 
Attention here is confined to consideration of a few simple concepts. 

The example system requires a lens having an aperture of 2 cm and a 
focal length of 5 cm. The first choice necessary is the selection of the type of 
lens to be used. For minimum complexity, it is obviously desirable to make 
the lens of a single element. We will discover some conditions for which this 
is possible. 
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6.6.1 Effects of Spherical Aberration and of Coma 


The type of single element lens which is allowable is determined pri- 
marily by the aberrations which are tolerable. First, consider longitudinal 
spherical aberration. The deviations of the wavefront from a perfect spherical 
converging wavefront emanating from the exit pupil are given by the aberra- 
tion function W. The aberration function produced by spherical aberration 
at a radius r in the exit pupil is expressable! as: 


үү = r^ o Сао (6.48) 


where 9 C49 is the spherical aberration coefficient. The maximum value of 
spherical aberration occurs at the pupil edge for r = rọ. For a thin lens of 
focal length f and index n which produces an image at an image distance i, 


| п+2 
Сл, = ————— 2 + А(п + 1 
0-40 3 а = 1) 1-14 ( )ра 
+ (Зп + 2) (n – Dp? +a], (6.49) 
where 
Е К, +R, (6.50 
d. R, — R, ? M ) 
and 
р=1 2 (6.51) 


If we use germanium for the lens, the mean wavelength іп the 8 їо 12 
band is n = 4.004 and 


о Сао = — 28 [0.0208q? + 0.200 pq + 0.4209p? + 0.2137]. (6.52) 
We are interested in an infinity focus for which i 7 f so that 


р= —1 апд Са, = -5 [0.020892 — 0.24 + 0.6346]. 
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The aberration coefficient is approximately minimized when 
_ _ 1 
а = 5 and ¿C40 = ET (0.155). (6.53) 


Now note that the wavefront deviation W = г) о Сао becomes W = —0.155 D,/ 
(64 (F/#)°). For our detector lens, О, = 2em, F/#=2.5,and W=3.1 X 107^ cm. 

This is a negligible amount of aberration (less than one-half wave). If it 
had been significant, the spherical aberration could have been partially com- 
pensated by a linear focal shift in the direction opposite to the spherical 
aberration. Obviously longitudinal spherical aberration cannot be totally 
eliminated in a distantly-focused thin lens of germanium because the aberra- 
tion function never goes to zero in the long wavelength band. The value 
q = 5 implies R, = 1.5 R,. 

Next consider coma. According to Born and Wolf?, the coma of a thin 
lens is eliminated when the following equations are satisfied: 


n? 


n? — 1 


А .2n*ll, 
К, п+10 


L , (6.54) 


and 


n?—n-1 


1 1 
R, п+10 n? — ] f^" 


(6.55) 


where O is the object distance. For О = œ and n = 4.004, 


2 2 
R, == г= 0.938 f and В, = 2 —*— 
n n —n-— 


f= 1.363f. 


Thus 


R, = 1.454R,, 


which fortuitously is approximately the same solution as the radii which 
minimize spherical aberration. 
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If the example system consisted only of a detector lens and a scanner, 
the detector lens shown in Figure 6.19 satisfying the above equations would 
suffice. Here a center thickness of 0.50 cm has been allowed for structural 
rigidity. However, if auxiliary optics such as magnifying telescopes are to be 
used, this design may not suffice because the best tradeoff between aberra- 
tion balance and overall complexity may dictate a two-element detector lens. 


CENTER 
THICKNESS = 0.50 cm 


R, = 8.501 ст 


f= 5cm 


Figure 6.19 An example detector lens with minimum coma 
and spherical aberration. 


6.6.2 E ffect of Chromatic Aberration 


The longitudinal chromatic aberration, or the difference in focus be- 
tween the two extreme wavelengths in the bandpass, is shown in Figure 6.20. 
It is found by calculating the difference between the focal lengths for the two 
wavelengths: 


Af = f(n = 12 um) — f(n = 8 um). 


First, define 


|. R4 Ra 
KER +R,’ 
_n(8) + n(12) 
п», 


апа 
An = n(8) — n(12). 
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Af = .005 cm 
— - y 5.667 cm | | 


f=5 cm 


Figure 6.20 Longitudinal chromatic aberration of the 
example detector lens. 


Then the separation of the two foci is: 


(6.56) 


where f is defined for the mean index n. A dispersion index V (also called the 
Abbe number) is defined by 


угла, (6.57) 


so that 


Af = (6.58) 


«IT 
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The shift on either side of a best focus is 


Ai= ү (6.59) 


For the example detector lens of germanium, 


_ _4.0038—1  . 
У (8 >12 um) = 70053 40933 = 1001.27, 


Af = (5 cm) 


~ -3 
= 100127 => X107 ст 


апа 


^i at = 2.5 X 1073 cm. 


This amount of linear defocus produces a dimensionless quality parameter 
A (see Section 6.6.4) of 


A=2 sin (PAi 


= 0.197, (6.60) 


which produces negligible MTF loss. 

Had this chromatic aberration not been acceptable, a technique of com- 
bining two or more separationless thin lenses to eliminate chromatic aberra- 
tion would have to be used. Hertzberger and Salzberg* showed that materials 
are available in the thermal infrared which permit 2 lenses to match focus 
for 2, and possibly 3 wavelengths. In theory, 5 matched wavelengths imply 
perfect color correction, and in practice four wavelengths suffice. Such a 
perfectly achromatized lens assembly is called a superachromat. Some 
examples of achromats are shown in Figure 6.21. 


6.6.3 Effect of Field Curvature 


The surface of best focus for an optical system may have a variety of 
three-dimensional shapes. When the optical system is completely free of 
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Figure 6.21 Converging beam scan optics. 
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aberrations, the curved image plane remaining is called the Petzval surface. 
The Petzval radius Rp of this field curvature is given by 


[op 
Rp |x ы] , (6.61) 


1 


where the n; and the f; are the indices and focal lengths of the individual 
lenses in the system. Thus an unaberrated lens produces a focal surface 
which is a section of a spherical shell, as shown in Figure 6.22. Obviously a 
vertical detector array contained in a plane and having its center at the 
optical axis will be increasingly out of focus as the array extremes are 
approached. This is shown geometrically in Figure 6.23. 


LENS 


Figure 6.22 Petzval curvature. 


The distance x from the lens to the Petzval surface as a function of the 
field angle 0 may be derived as follows. By construction, 


Rp’ =(Rp – fy +x? + 2x (Ер — f) cos 0. 
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Figure 6.23 A section of the Petzval surface in the plane of the paper. 


Solving for x we find 


x =V Rp? — sin? 0 (Ry — fy — (Rp — f) cos 0. (6.62) 


The defocus e(0) from a flat detector plane is given by 


e(0) = — x(0). (6.63) 


cos 0 


The geometrical blur diameter w is shown in Figure 6.24, and is given by 


This problem is exaggerated in a parallel scan system where an extended 
detector array is necessary for simultaneous total vertical field coverage by 
all detectors. A serial scanner having a short array and operating with a 
parallel beam scanner does not have this difficulty. Many solutions to the 
problem exist, among them designing the optical system to have a flat focal 
surface, fabricating the detectors on a continuously curving focal plane 
assembly, or constructing the array of flat canted segments as shown in 
Figure 6.25. 
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Figure 6.24 Blur due to focus error. 
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Figure 6.25 Segmented detector array. 
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For our example system with its 5 cm focal length detector lens con- 
structed of germanium of index 4.004, the radius of field curvature is 
20.02 cm. If the example system is implemented as a parallel scanner with 
150 elements 0.005 cm high separated by 0.005 cm, the array height will be 
1.495 cm. The field of view half-angle A/2 subtended by half of the array 
including interlace is: 


a= arctan (т. = 8.531". 
For 0 = А/2, 
x = У (20.02)2 — (20.02 — 5)? sin? (8.531?) — (20.02 — 5) cos 8.531? 
= 5.046 cm. 


The focus error at the array edge will be: 


_ __5 ст _ 
€ = Cos (8.5319) — 5.046 = 0.01 cm. 


The geometrical blur resulting is 


– ___(2 ст) (01 ст) ____ ап 
G2 X3 cm) (5 em + 0.01 ст) 0.000399 radian ~ 0.4 mrad. 
Thus for the example system edge defocus is not a problem. 
One optical solution to the problem of field curvature consists of rede- 
signing the detector lens as a doublet having zero curvature such that two 
conditions are satisfied: 


fi = hf (6.64) 
2 f, +f, | 
and 
Ll io apo, (6.65) 
nif, afa ‚о? nz p | 
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Thus the requirements are 


p= 2o e 
1 ni 12» 


and 


р-н, 
2 n, 12, 
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(6.66) 


(6.67) 


and а flat field with a doublet requires that two different lens materials be 
used. The other optical solution to field curvature is the addition of a field 


flattening lens near the image plane. 


6.6.4 Effect of Defocus in a Diffraction-Limited Optic 


A very practical problem concerns the loss of image quality due to a 
defect of focus such as could arise from thermal effects in the optic or by 
human error in mechanical focusing. Consider the simple case shown in 
Figure 6.26, where a lens of diameter D and focal length f images an infinitely 


distant object. 


BLUR DIAMETER d 


Figure 6.26 Blur due to defocus. 
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The construction of Figure 6.26 shows the effect upon an optic focused 
on a point at infinity of an error in the placement of the focal plane. Two 
cases, error Ai, to the left of, and an error Ді, to the right of the focal plane, 
are shown which result in the same geometric blur diameter d. By similar 
triangles 


d/2 D/2 рді, | ОА, 

Ai f- A О  f—A f - (6.68) 
and 

4/2 _ D/2 Е DAi, _ DA, | 

Ai, f+ Ai, or d ТФА, f (6.69) 


Note that a specific defocus on either side of the focal plane produces 
approximately the same geometrical blur, d = DAi/f. The corresponding 
angular geometric blur diameters 6 are obtained by dividing d by f, 


_ DA; 
6-— (6.70) 


Hopkins? derived an expression for the OTF as a function of the focus 
error Ai, the F/#, and the wavelength, for the case of a diffraction-limited 
circular aperture illuminated by a monochromatic plane wave. Levi and 
Austing? simplified Hopkins’ equation to 


| 1] 
MTF (»,, A) al (1 — x2)!” cos [a(x — »,)] dx (6.71) 
d 
where 
а = 2п0 А (6.72) 
__ „(р1,. 
А = 2 sin (2) ДИХ (6.73) 


Ai = linear defocus in the same units as А 


v, = relative spatial frequency in dimensionless units of 
f/f., 0 & », & 1l, f, = D/A. 
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They calculated and tabulated values of MTF (»,, A) for values of у; 
from 0 to 50. Figure 6.27 graphs some of those results. Geometrical optics 
predicts an MTF degradation of Bessinc (df), but according to Hopkins, this 
prediction is accurate only for frequencies up to about f/f, = .1, regardless 
of the focus error. At higher frequencies, the geometrical optics prediction 
overestimates MTF, regardless of focus error. 

To demonstrate the use of these analyses, consider the example system 
used without a telescope. The detector lens focal length is 5 cm, the aperture 
is 2 cm in diameter, and the detector element size is 0.005 cm by 0.005 ст. 
If the detector lens is diffraction-limited, its MTF cutoff frequency f, fora 
perfect focus is 


Па 2 cm 
À 


f = = 
1072 cm 


C = 2 cy/mrad. 


The detector angular subtense is 1 mrad, giving an MTF first-zero frequency 
of 1 cy/mrad. The resulting MTF’s and their product are shown in Figure 6.28. 

Let us assume for the sake of argument that a defocus which causes the 
product тога to go to zero near 0.5 cy/mrad is severe enough to be objection- 
able. Referring to Figure 6.27, we see that A = 2.5 is sufficient to cause this 
degradation. For our example system, 


. 2 cm Ai 
2.5=2sin? (=|) —2— 
m (5 (5 a5) 10-3 ст 


ог Аі = 0.0316 cm. 


Since 
Ai =< or d «Әді. , 
d ~ 2 ст (.0316 cm) _ 0.0127 ст 
5 ст 
and 
5 ~ d.- 0.0127 cm _ 4 99953 radian = 2.53 mrad. 
f 5cm 


The tolerable defocus blur is thus about 2-1/2 mrad. 
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MTF . 


0 1 2 3 4 5 6 7 8 B 1.0 
SPATIAL FREQUENCY [cy/mrad] 


Figure 6.28 MTF's of the example system for perfect optics and detectors. 


6.7 Auxiliary Optics 


The detector lens may be augmented by the use of a magnifying 
telescope to increase the resolution. Magnification decreases the field of 
view, increases the angular resolution, and keeps the ratio of detector sub- 
tense to clear aperture diameter constant so that thermal sensitivity does not 
change. Figure 6.29 shows some possible astronomical (image inverting) tele- 
scopes. The first telescope consists of a positive single-element objective lens, 
a real image plane at which a field-of-view-defining stop is located, and a 
positive single element collimator or eyepiece lens. 

The second telescope shows a weakly negative element used to reduce 
chromatic aberrations in the objective. The material of this chromatic 
corrector must exhibit a higher dispersion than the material of the first 
element in order to correct the shift of focus with wavelength. The focal 
length of this objective is given by 


гі; 


(6.74) 


Zero longitudinal chromatic aberration in the objective requires that 


fio (1) = fi (А). (6.75) 
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DETECTOR LENS 
COLLIMATOR LENS 
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TELEPHOTO 
OBJECTIVE 


DETECTOR LENS 


Figure 6.29 Astronomical telescopes for parallel beam scanning systems. 
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Solving for d and substituting the relations 


f, (X) f; A) 


Га O7 £o) 4f Q) - d 
and 
_ k 
[7209-1 


expressed for the two wavelengths А, and A, yields 


f, f, 
d- | у Am т D -Ama,-D 
An, (n, — 1) An, (п, — 1) 


In equation 6.78, 
Ап, = п, (№) — n; (Ay), 


An, = п (№) — n; (À), 
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(6.76) 


(6.77) 


(6.78) 


(6.79) 


(6.80) 


(6.81) 


(6.82) 


(6.83) 
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The combined focal length then is expressed by 


f р, (V, + V2) 


127 f, У, + f, V, . (6.84) 


As an example, if the first lens is made of germanium, V, = 1001.27 from 
8 to 14 um, and if the second lens is of zinc selenide, V, = 34.45. Then 


d =0.9667f, + 0.03324f,. 


The parameters may be selected for the best aberration balance. The 
chromatic correction equation for a cemented doublet of thin lenses having 
focal length 


az + (6.85) 


-У, 
fi TV. fj, (6.86) 
m 
f “V, V, f, (6.87) 
and 
V-V; 
f, 2-2 f. (6.88) 


The third telescope shown in Figure 6.29 is an astronomical telescope 
with a telephoto objective. The first element has a higher power than the 
first element of the simpler astronomical telescope, followed by a negative 
second element. This produces a shorter overall length for a particular F/#. 

Figure 6.30 shows a Galilean telescope. The first element is positive, the 
second element is negative, and the separation is the sum of the focal lengths. 
There is no image plane, and for a compact design the exit pupil is inside the 
telescope and the exciting beams diverge. Therefore, Galileans are most useful 
when placed in front of the detector lens in a converging beam scanner. 
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қ >0 


d 


f«0 


SEPARATION =f, * f; 
Figure 6.30 A Galilean telescope. 


A dual field of view may be implemented in a variety of ways. Some 
typical telescopic systems are shown in Figure 6.31. Finally, Jamieson? has 
shown that workable continuous-zoom infrared optical systems can be de- 
signed using no more than two or three elements. Jamieson discusses the 
image quality of such simple zoom systems and presents some designs. 


6.8 Thermal Imaging Optical Materials 


Three classes of optical materials commonly are used in thermal im- 
aging lenses and windows: semiconductor (single crystal or polycrystalline) Si 
and Ge, hot-pressed polycrystalline II-IV compounds such as the IRTRANs;* 
chemical vapor deposited ZnSe and ZnS; and the chalcogenide glasses such 
as TI 1173.+ Silicon and germanium are used for the majority of applications 
because their high refractive indices and excellent mechanical strengths and 
hardnesses simplify optical design. 

Ideally the refractive index n of a thermal imaging lens material should 
be high, should have zero change with temperature, and zero change with 
wavelength (zero dispersion). The material should have zero coefficient of 
thermal expansion, high surface hardness, high mechanical strength, com- 
patibility with antireflection (AR) coatings, zero solubility in water, and low 
infrared absorptance. High refractive index is necessary to minimize lens 
curvatures, lens thicknesses, and the number of lens elements in a given 
F/# system. Low thermal coefficient of refractive index is necessary to 


*Trademark of the Eastman-Kodak Corporation 
1 Trademark of Texas Instruments, Incorporated. 
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(a) 


| 
FOV SWITCHING MIRROR 


MOVES IN AND OUT OF 
PLANE OF THE PAPER 


Figure 6.31 Dual field telescopes: (a) two separate objectives selected 
by a switching mirror, (b) a coaxial two-step zoom telescope. 
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prevent aberration unbalancing and focal length changes as the lens temper- 
ature changes. Low dispersion is necessary to minimize chromatic aberration 
and the number of lenses necessary to compensate for it. Zero change of lens 
dimensions with temperature is necessary to avoid aberration-unbalancing 
and focal length changes, and to simplify lens mountings. 

High surface hardness prevents scratching and abrasion, and high mech- 
anical strength allows thin lenses (high diameter-to-thickness ratio) to be 
used. Compatibility with AR coatings is absolutely essential because lens 
surface reflection losses increase with increasing n. Nonsolubility in water is 
necessary to assure lens integrity in rain and high humidity. Low radiation 
absorptance minimizes absorption losses and allows efficient system design. 

No known material possesses all of these desirable qualities, but silicon, 
germanium, zinc selenide, zinc sulfide, and TI 1173 come the closest to 
perfection. They have high indices, hardnesses, and strengths, are compatible 
with AR coatings, are insoluble, and have low absorptance. Silicon and 
germanium are the most frequently used thermal lens materials. The optically 
important properties of the useful materials are summarized in Table 6.1. 

Herzberger and Salzberg* found that the dispersion of infrared optical 
materials is accurately described by the equation 


n=A+BL+CL? + РХ + ЕМ, 


where 
1, = (№ — 0.028)". 


The coefficients they derived are given in Table 6.2. Such an equation is 
useful for interpolating between measured index values. The dispersion of 
several materials is shown in Figure 6.32. References 8 and 9 through 12 
contain the detailed discussions from which this section was abstracted. 


6.9 Thermal Effects in Optics 


Most thermal imaging refractive lens materials, including germanium, 
have refractive indices which change significantly with temperature at terres- 
trial temperatures. Many applications of thermal imaging systems are used in 
environments where temperature extremes of -20°C to +40°C are not un- 
common, and in other applications such as high altitude aircraft, high speed 
aircraft, or space vehicles, the extremes may be greater. Thus the effects on 
lens parameters due to thermal index changes cannot be ignored. 
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Figure 6.32 Refractive indices of common thermal imaging lens materials. 


In a single lens composed of one material and located in air, the lens 
focal length f is related to the refractive index n by 


Ё = , (6.89) 


where k is a geometrical constant if the lens material does not expand. The 
partial derivative of f with respect to n is 


of k 


and the partial derivative of n with respect to T is dn/dT. Thus a small focal 
length shift for a small temperature change T is given approximately by 


k on f on 


Af =- 1: D» эЭТАТ“-ттт 3T AT. (6.91) 


If the focal shift is large, it will be given by 


k k 


т у: (6.92) 
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The thermal coefficients of index of the useful materials were given in 
Table 6.1. The image of an infinitely distant object shifts by an amount Af 
toward the lens for negative дп/дТ and by Af away from the lens for positive 
дп/дТ. This must be compensated if it exceeds the desired depth of focus. 
Since all of the useful materials have significant thermal coefficients, it 
is necessary to devise a compensation scheme. There are four possible 
solutions: 
1. Provide a manual focus adjustment. 
2. Provide an automatic electromechanical focus adjustment. 
3. A-thermalize the lens by a combination of lens materials having ап 
effective дп/дТ of zero. 
4. A-thermalize the lens by using lens mountings which move passively 
to compensate the defocus. 


The detector lens for the example system at 300°K has f=5 cm, n = 4.004, 


„дп _ -6 9(-1 
3p +300 X 10 С”, 


апа 

Af = — 4.993 X 10% AT [cm]. 
For 

AT = +10°C, 

Af = — 4.933 X 10? cm. 


This produces a defocus dimensionless A (see Section 6.6.4) of 0.395, which 
is a trivial defocus. A temperature shift of +50°C, however, would produce a 
A of 1.97, which is a serious defocus which would have to be compensated 
for by a refocus mechanism. 

For small temperature shifts the thermal expansion of any lens dimen- 
sion R is given by 


ДЕ =RaAT, (6.93) 


where a is the coefficient of thermal expansion in [cm/(cm°C)]. It is trivial 
to show that the focal length of a single lens changes as 


i - of. (6.94) 
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The germanium detector lens for the example system has 
f=5cmanda=6.1 X 106 °С-!. 
For AT = +10°C, the defocus is 
ДЕ = (5 cm) (6.1 X 106 °С-!) (10°C) = 3.05 X 107 ст, 


which is an insignificant defocus. 

This example demonstrates that lens thermal expansion effects tend to 
be small compared to the effects of thermal refractive index change. However, 
lens housings also expand and contract to produce defocus. If the example de- 
tector lens were held іп an aluminum housing having а = 23 X 10°°°C"!, the 
linear defocus 6 produced by the housing expansion alone for a 10°C AT 
would be 


6 = (5 cm) (23 X 109) (10) 
= 1.15 X 10? ст. 
All of these effects are accountable by a total defocus given by 


f дп 


ô = ~n—10T 


+ af — aL} AT, (6.95) 


where L is the overall length of the lens system. The thermal expansion coef- 
ficients for some common lens housing materials are given in Table 6.3. 


TABLE 6.3 
Thermal expansion coefficients of some common lens materials 


-6 
MATERIAL APPROXIMATE ЈЕ = 
cm C 
Aluminum 23 
Magnesium 26 
Stainless Steel 11 
Titanium 9.5 
Beryllium 11.5 


Ni 42 (low expansion alloy) 5.7 


264 Chapter 6 


6.10 Passive Optical Compensation of Thermal Refractive 
Index Changes 


The focal length of a simple lens at temperature T, is defined by: 


КТ.) = (6.96) 


__к 
(То) — 1°” 


where n(T,) is the index at that temperature. Most useful thermal imaging 
optical materials exhibit significant thermal coefficients of index. For zero 
thermal coefficient of expansion, and non-zero coefficient of index, the focal 
length at T = T, + AT is given approximately by: 


k 


f(T) = hat Зао) | (6.97) 
n(To) T 
if (02 п/дТ? ) = 0 or for small AT. 
The change in focal length is: 
on(To) 
k oT 
Af = f(T) — f(T,.) = — = |—————— 
(D) о) = тоу Эп(Т„) 
n(T,) — 1 + AT “ЭТ. 
én(T,) 
= —f AT 3T ^ (6.98) 


since n-1 > AT(0n/dT) for infrared materials and for reasonable temperatures. 

Passive optical compensation requires that дЇ/дТ = 0 over some speci- 
fied temperature range. Completely successful passive compensation is proba- 
bly never realizable because the lenses of a system will not always heat up 
uniformly, so that gradients across lenses and differences from lens to lens 
will occur. 

As an example, consider the doublet shown in Figure 6.33. If the lens 
separation s is small compared to the product of the two lens focal lengths, 
the focal length of the combination is: 


fif 
p- 


EATA (6.99) 
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f, >0 f, <0 


Figure 6.33 A lens system for passive optical compensation of 
refractive index change with temperature. 


The derivative is: 


Of _ f д 9, 1 of, of, 
T^ (т кт (т әт): (6109 


опе сап readily show that: 


l дп, /9T l дп, /9T 


Thus, for perfect passive compensation, we require the bracketed term to be 
zero. Defining the relative coefficient of index I as 


_ дп/9Т 


І n—1" 


(6.102) 
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the requirement is: 
2 = 5 (6.103) 


If the lens combination is to be convergent, only three cases are allowed: 


I, 


CASE 1: 1 
1 


<0,f, >0,f, >0 


1, 


СА$Е 2: ] 
1 


>1,f, >0, f, <0, |[> If, | 


1, 


CASE 3: 0 «1 
1 


<1, f,<0,f, >0, Ifi | >If, | 


As an example, if the first element is of ZnSe and the second is of 
germanium, 


f, = —0.34 f, = .66 f, 


fj = —1.94 f. 
TABLE 6.4 

1 дп А 
MATERIAL 09-25 С,8<Х< 11, 
Germanium +100 X 106 °С 
ZnSe +34 X 1079 
ZnS Unknown 
TI 1173 +50 X 1079 
As; $3 —7.25 X 1079 


T£, Вг! ~171 X 1075 
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One disadvantage of this approach is that the chromatic aberration 
correction becomes more complicated. The longitudinal chromatic aberra- 
tion for a separationless doublet of thin lenses is derived as follows: 


_ Of of 
Af т An, + jn; An; (6.104) 
of 2 - 
Эл on E fa (fı + £2) | 

af, f, of, f, of, OfjN fif, 
“On 1) +f, an fi +f, Von Ong «ry (1°? 
of f f, of fif 
Эп за +h би Ly 6199) 
at _ 95 f; h fih- (6.107) 
дп, дп, f + f, дп» (f, + f, y | 
of | f 
= (6.108) 
and Af is found to be 

- efl ,l. 

Af=—f I. r^v z | (6.109) 


In the case cited above, V, (germanium) = 1113, V, (zinc selenide) = 83, and 
Af = —f/73. 

Therefore, the effect in this case is equivalent to that of a single lens 
using a material more dispersive than ZnSe, and chromatic correction re- 
quires use of a third element even more dispersive, such as ZnS. 


6.11 Off-Axis Image Quality 


The system specifier or designer is frequently faced with the necessity of 
deciding how much off-axis image degradation to allow relative to the per- 
formance in the center of the field. 

Section 4.11 indicates how an observer’s sensitivity to change in resolu- 
tion depends on the baseline resolution. One liminal unit of resolution change 
is typically 5 to 10 percent of the baseline resolution at low resolution, and 
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resolution change becomes harder to detect as resolution improves. Obviously 
off-axis degradations may fall near one liminal unit with little or no per- 
formance loss. Section 10.2 will show that observers searching a field tend to 
concentrate on the central 9 degrees of the display, paying less attention to 
the display extremes the larger the angular size of the display becomes. 
Clearly if image quality is to be sacrificed in some part of the image, it 
should be the least-used part. Thus resolution at the edges of large displays 
can be poorer, but not so poor that it becomes distracting or disconcerting. 
Some resolution loss at the field edges of optical systems is unavoidable due 
to distortion and lateral chromatic aberration. A factor of two to three 
in resolution is not objectionable, and is usually not noticed if one is not spe- 
cifically looking for it. This has been demonstrated conclusively in the opera- 
tion of circular-scan FLIR’s having deliberate resolution variation from the 
center to the edge of the field! ?. 


6.12 Image Defects in Fixed-Focus Systems 


An optical system which has its focus fixed for a particular object 
distance O is not necessarily in good focus for all object distances of interest. 
Consider the simple converging lens with aperture diameter D shown in 
Figure 6.34 focused at an image distance i for an object at O less than infinity. 
Figures 6.35 and 6.36 show the foci for objects at O, and O, other than O. If 
a thin detector is placed at i, it will see an object at O in perfect focus and 
objects at O, and O, impertectly focused. First-order geometrical optics 
predicts that the images at i of points O, or O, will be circles with diameters 
d, and d, respectively. The quality of the focus depends in a complicated 
way upon the blur diameter, as for example the case of the defocused 
diffraction-limited optic described in Section 6.6.4. 

It is common practice to establish a criterion of goodness relative to the 
detector dimension based on theoretical considerations or practical expe- 
rience. Whatever the criterion, it is useful to know the limits of object or 
image movement for which a particular goodness criterion is violated and to 
know this in terms of the blur diameter d or the associated angular blur 
diameter 5. Referring to Figures 6.37 and 6.38, the distance О, <O for 
which the criterion is met may be called the near depth of field, and the 
distance О, > О for which the criterion is met may be called the far depth of 
field. The corresponding shifted image plane focal points may be called the 
near depth of focus i, and the far depth of focus i, . 
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MODERATE RANGE 
OBJECT 


Figure 6.34 Focus for a distant object not at infinity. 


FAR OBJECT 


| 0, | іі | 


Figure 6.35 Focus for an object more distant than in Figure 6.34. 


NEAR OBJECT 


| О; | l2 


Figure 6.36 Focus for an object nearer than in Figure 6.35. 


270 Chapter 6 


Figure 6.37 Defocus for an object more distant than the in-focus range. 


| o | r | 


Figure 6.38 Defocus for an object closer than the in-focus range. 
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Consider first the case of Figure 6.37 where the image at i, of an object 
at O, comes to a focus in front of the detector at i. A point at O, is imaged 
as a circular blur of diameter d, . For sufficiently large F/#, 


D d 1___ l . 

i i-i Di" (4, ғың: (6.110) 
Using 

11 1,11 1 


it is easy to show that 


_ ООР ~ DOf 
О ADF di) 40 07-40 (6.111) 
for small defocus and that 
. РЕ 
14 = D 4 d, (6.112) 


Next, consider the case shown in Figure 6.38 where the image at i, of 
an object at O, imaged as a circular blur of diameter d, at i. As before, one 
can show that 


DOf _ рог 


О = SFF d, (O-N Df4djO: (6.113) 


and 


TET (6.114) 


The far depth of field О, extends to infinity when O = Df/d,. This in- 
focus object distance is called the hyperfocal distance Ор. When O = Ор, the 
near depth of field O, becomes Df/2d, , or О,/2. 
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Note that from previous equations, when the blur diameter is small 
compared to the aperture, іу 2 5 = f, so that the angular blurs associated 
with d, and d, can be written as 


d; 


бі ~ — and p =. 


f 


Then the depths of field and of focus may be rewritten as 


0, -5 3 (6.115) 
i "Di5. (6.116) 
О, = Di (6.117) 
bpm. = (6.118) 


When the object distance is infinity, 


D 
О, =7—, (6.119) 
0; 
so that the hyperfocal distance and the near depth of field for the infinity 
focus case are identical. 
Solving for the d’s and 6% gives 


d, =Df (5-%:) (6.120) 
d, = Df (о;-б) (6.121) 
5, =D (0-0; (6.122) 
5, -D(s. -б) (6.123) 
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When the object is at infinity, 


D 


5, = — 
2 0, 


(6.124) 


Of related interest is the amount the detector focus may shift due to 
mechanical wear, to shock, or to vibration, before intolerable blurring re- 
sults. Consider Figure 6.39 which shows two cases of defocus Ai. For large 


F/# 


.D.d. 
20 == д › 
and 
ді- S.B = ВР, (6.125) 


where 6 is the tolerable angular blur, 


рі -і----- 


Figure 6.39 Defocus due to shift in image plane. 


| ai | ai | 
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Finally, consider Figure 6.40, which shows the focus at an image dis- 
tance f + Ai for an object at O less than infinity. The lens law is 


| 1 _1 
O f*A f (6.126) 
Or 
i= Pp ~É 
Ai= бр =O · (6.127) 


Thus the image shifts away from the lens and the infinity focus by this 
amount as the object distance shifts inward from infinity. 


Figure 6.40 Focal plane shift as an object moves in from infinity. 


Referring back to Section 6.6.4, the tolerable defocus blur for the 
example system is about 2-1/2 mrad. If the detector lens is focused at in- 
finity, the near depth of field O, for this defocus is 


D 2 ст 


O2 =>. = 0.0024 radian 


~ 789 ст = 8 meters. 
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The hyperfocal distance for the allowable defocus is 


On => 8 meters, 


and the near depth of field for the hyperfocal focus is 4 meters. 

This example demonstrates how relatively insensitive the MTF is to 
defocus. Here a 1 mrad system with a 5 cm focal length has been reduced to 
a 2 mrad system by a geometrical defocus blur of 2-1/2 mrad with a linear 
defocus of 32 thousandths of an inch. 


6.13 Cold Reflections 


Any thermal imaging system which uses refractive optics and cooled 
detectors can exhibit an image defect resulting from internal retroreflection 
of cold surfaces onto the detectors. When the detectors sense their own cold 
surfaces relative to their warm surround, this phenomenon is called the 
narcissus effect. Narcissus arises whenever at any point in the active scan any 
part of the cold detector focal plane is reflected by an optical element so that 
the reflected image is in focus at the array and a warmer background appears 
at other points of the scan. Two common examples are narcissus from a flat 
window and from a lens surface concave toward the detector. 

As a simple example, consider the case of Figure 6.41, where a con- 
vergent meniscus lens is used with a converging beam scanner. The lower 
part of the figure shows a moving detector equivalent of the system where the 
effect of defocus due to the scanner is neglected. Although narcissus prob- 
lems in practice are not so simple, we will use this system as an example 
because it demonstrates the essential features of any narcissus problem. 
Assume that the lens focal length is f, the detector dimensions are a and b, 
the detector focal plane is at a cold temperature Те, and the focal plane 
surroundings are at a warmer temperature Twy. 

Consider the worst possible case where the lens inner radius of cur- 
vature R, equals the lens focal length f. Then if the rear lens surface has a 
reflectance r, that surface acts as a reflective lens with a focal length 
equal to R,/2. Then any point in the focal plane is reflected to another point 
in the focal plane as shown in Figure 6.42 and as may be verified by using 
the lens law. This is an unrealistic example, but it is necessary to understand 
this case before practical examples may be analyzed. 

Figure 6.41 illustrates by means of the moving-detector equivalent that 
the detector will see the warm surround of the focal plane at the scan 
extremes, and the cold focal plane itself in the center portion of the scan. 
The reflections from the lens surface therefore cause the detector to sense 
not only the scene radiation but also scan-varying radiation from the focal 
surface. 
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A CONVERGENT-BEAM MIRROR SCANNER 


DETECTOR 


COLD FOCAL PLANE 


MOVING-DETECTOR EQUIVALENT 


PARTIALLY-REFLECTING SURFACE 


Figure 6.41 A convergent beam mirror scanner 
and its moving detector equivalent. 
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Figure 6.42 Narcissus geometry. 


As the detector signals are ac coupled, the large difference signal between 
the cold focal plane and its warm surroundings will be superimposed on the 
scene video, and the display will show the scene and a dark image of the 
focal plane. This typically has the appearance shown in Figure 6.43, where 
the shape of the narcissus pattern depends on the shape of the focal plane 
cold area and the extent to which the focal plane is defocused. The magni- 
tude of this negative signal may be calculated as follows: The cold surface at 
Тос radiates as: 


С! 
W = oo. 41 
х (Te) X (ес АТС) (6.128) 
The warm surface at Тү, radiates as: 
Ci 
(6.129) 


Wy (Ty) = A5 (ес2/АТ ұу — 1) | 
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Figure 6.43 Appearance of narcissus. 


If we assume that these surfaces are Lambertian radiators, the detector 
receives "cold" spectral power Ру (Ту), 


W, (T 2 Wa (To) D? ab 
P, (Трег АС) тр? ab gi = (Mato? =) (6.130) 


Т 4R? RÀ 2 4R? 


2 


апа “warm spectral power Ру (Tw), 


W (Tw) D? ab 
Py ap (Bee (6.131) 


The difference in spectral power АР; is: 


ab D? 
К; 


AP) =r [Wy (T4) — Wp (ТОЈ. (6.132) 
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The total integrated difference power AP within a spectral bandpass from 
A, to `, assuming uniform reflectivity, is: 


2 Аҙ с Сі 
Ap-I2bD. _____-________________| dX. (6.133) 
4R; А5 (e02/ATw 1) А5 (ес Те |) 


1 


To determine the objectionability of this signal, it must be compared to 
the difference power received by the detector from typical scenes. With 
reference to Figure 6.44 showing a radiating object subtending the detector 
solid angle, the change in received power with respect to scene temperature 
T, change is: 


Аҙ 
ӘР; OW, (То) TDo? 
— s | А8 __°_ p2 
9T, = 06 || aT. dÀ АЕ? R^ 79 
1 
Аҙ 
dW (Ts) 
=4 [arta Do? To of. (6.134) 


Figure 6.44 Geometry for radiation calculation. 


For a small scene temperature difference АТ, the change in received 
power is: 


Аҙ 

УМ, (T) 

Ав ay. (6.135) 
ME 


1 


AP, = AT, авто Р 
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Finally, we can take the ratio of AP to АР,, 
Ay 
! с c 
AP г (ab/R3)D?_ ЈА, | ad (ес ГА Tw – 1) (ес2 /А Tc — 1) 
ШІ L——— M => и 


AP, AT, a то Dà 2 IW (T) 
LA S an 
| oT (6.136) 
1 


For the simple single lens case, RÀ = 12, and ab/RÀ = ap, and also 
D = Dy. To make a sample calculation assume that Ту = T, = 300°K, that 
Т, =77°K, and that the spectral bandpass is 8 to 14 microns. Then the 
second integral in the numerator becomes negligible with respect to the 
first integral, and 


14 
C1 
[=== an 
5 (202 [МТС _ 
Ap. r DREAM | (6.137) 
АР; AT, То 
д\/ СТ.) 
ал 
8 0T, 


The integral іп the numerator has the value 1.72 X 107? watt/cm? and 
the denominator integral is 2.62 X 10% watt/cm?. Thus for unity lens 
transmission, 


If the NETD and MDTD of the system are such that broad targets having 
AT, of 0.01°C or better will be detected, and if the lowest possible reflectivity 
using an antireflection coating is 0.01, then the cold reflection will exceed 
the NETD by a factor of 66. 

The above calculations represent the worst case where a lens surface is 
confocal with the focal plane. In practice, however, the lens surfaces are 
usually such that retroreflections are defocused. In general, a confocal surface 
in a single detector lens will produce 


AP _ 66 e j r 
AP, RD,/ т. AT, 


where R is the radius of curvature and D is the diameter of the confocal lens 
surface. 
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For the case of the example detector lens, К = К, = 8.501, D = Do= 2 cm, 
and f = 5 cm. Then 


АР бе | (5) Q) | г 
АР, 8.501 (2) т. АТ, 


_ 23r 
то AT, 


If r = 0.01 апат, =1, 


which is an unacceptable level of narcissus. 

Since the cold signal is negative relative to the average scene value, and 
since the average scene value is usually imaged on the display at a low lumi- 
nance value, the cold signal will usually drive the display to black. Practical 
cases where narcissus occurs may involve reflections from an entrance window, 
from parts of the sensor housing, from the scanner itself, and from lens 
surfaces anywhere in the system. The essential features of the derivation 
given above will usually be adequate to analyze any problem if the proper 
geometrical reflection factors and intervening optical transmissions are used. 

Five procedures may be used to reduce cold reflections to an acceptable 
level. These are: 


l. Reduce the focal plane effective radiating cold area by warm 
baffling. 

2. Reduce lens surface reflections by using high efficiency antireflec- 
tion coatings. 

3.  Defocus the cold return by designing the optical system so that по 
confocal surfaces are present. 

4. Cant all flat windows. 

5. When all else fails, null out the cold reflection with a thermal 
source or by electronic video signal compensation. 


Combination of the first four techniques will usually suffice to eliminate 
narcissus. 
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CHAPTER SEVEN — SCANNING MECHANISMS 


7.1 Introduction 


The function of a scanner in a FLIR is to move the image formed by the 
optical system in the plane of the detector array in such a way that the 
detectors dissect the image sequentially and completely. As shown in Figure 
7.1, there are two basic types of scanners: parallel beam and converging 
beam scanners. The parallel beam scanner consists of an optical angle- 
changing device such as a moving mirror placed in front of the final image 
forming lens. The converging beam scanner consists of a moving mirror or 
other scanning device placed between the final lens and the image. 

There are seven commonly-used optical scanning mechanisms: the 
oscillating mirror, the rotating polygonal mirror, the rotating refractive prism, 
the rotating wedge, the revolving lens, the rotating sensor, and the rotating 
V-mirror. One-dimensional and two-dimensional scanners may be implemented 
by various combinations of the basic scanning mechanisms discussed in the 
following sections. 


7.2 Oscillating Plane Mirror 


The oscillating mirror shown in Figure 7.2 oscillates periodically between 
two stops, and may be used as either a parallel beam or a convergent beam 
scanner. The oscillating mirror produces a ray angular deviation of twice the 
mirror angle change, as shown in Figure 7.2. This property may be verified 
by using the law of reflection as follows. Consider Figure 7.3, where two 
mirror positions M, and M, separated by an angle y are shown together with 
the respective mirror normals Мі and N,. Incident rays I, and І, are shown, 
both of which exit as E. The angle between I, and L,, 4(1,,I,) as a function 
of y is 


A(I; 1) = X(E,I; ) — ЕД, ). (7.1) 
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Figure 7.1 A parallel beam and a convergent beam scanner. 
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Figure 7.2 Angle doubling by an oscillating mirror. 


Figure 7.3 Geometry for angle doubling derivation. 


285 


286 Chapter 7 


From the law of refraction, 

X(I4,12) = 2 4(E,N,) — 2 X(B,N,). (7.2) 
Since the mirror normals change by 7, 

X(I,,L,) = 2[4(E,N, ) — СКЕМ, ) — 7)] = 27. (7.3) 


7.2.1 Converging Beam Scanner 


The oscillating mirror convergent beam scanner is such a common 
mechanism that it is worthwhile determining in detail how the mirror angular 
motion produces scanning and affects focussing. Consider Figure 7.4, which 
for simplicity shows only central rays for two mirror positions. Figure 7.5 
shows the same deviated and undeviated rays reflected about the deviated 
mirror position, and shows the ray paths which would result if an equivalent 
in-line scan mechanism were used. This construction simplifies the derivation. 
Let the scan angle measured from the optical axis be 0; let the mirror dis- 
placement from an arbitrary rest position be y; let the mirror pivot point be 
P; and let the detector point be D. Dimensions “а” and “b” are defined as 
shown. 


UNDISPLACED MIRROR 
a MIRROR 


ANGULAR 
DISPLACEMENT 


DETECTOR 


DISPLACED MIRROR 


Figure 7.4 Oscillating scan mirror principle. 
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N DISPLACED 


UNDISPLACED 


Figure 7.5 Path length change. 
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The scan angle @ asa function of mirror angular displacement y may be 
found from the construction of Figure 7.5. Here the triangle PDQ is 
reflected through the dotted displaced mirror plane to form triangle PD'Q. 
The total ray path length in the displaced position is c, and 


c<atb. 
It is evident from the construction that 
А-0-ҒЕ (7.4) 


Since geometric reflection of path PD about the mirror causes ап 
angular shift of double the mirror angle change y, we have also that 


= 2+ (7.5) 
so that 
+ 
y -2 | (7.6) 


Now consider the triangle formed by a, b, and c, and extend a perpendicular 
h to side c from P. Clearly 


sin £ -2, (7.7) 


sin 0 PE (7.8) 


and therefore, 


Е = arcsin (2 sin 2) (7.9) 


Substitution of equation 7.9 into equation 7.6 yields 


0 + arcsin (2 T 2) 


yer? (7.10) 


Scanning Mechanisms 289 


Manipulation of equation 7.10 yields 


_ bsin2 
tan 0 = р cos2y’ (7.11) 
or 
- b sin 27 
0 = arctan p T bcos2 =|. (7.12) 
This is the desired relation. For sufficiently small 0, this reduces to 
.2by 
0 Lb (7.13) 
or 
0 (а+Ъ 
y= заты (7.14) 


These expressions may also be derived from the construction of 
Figure 7.6. From that figure, 


– y 

tan 0 iT (7.15) 

y = b sin 27, (7.16) 
and 

z = b cos 27. (7.17) 
Then 

tang - 52, (7.18) 

bt cos 27 

or 

0 = arctan _ sin 2y _ . (7.19) 


a 
b + cos 2y 
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Figure 7.6 Path length construction. 


Referring to Figure 7.6, it is evident that the path length c from the 
lens to the detector is shortened as the mirror causes the chief ray to scan off- 
axis, OT 


c<atb=f (7.20) 
for 0 #0. A perfect lens produces an image plane which is spherical with a 


radius R given by equation 6.61. The distance of this surface from the lens 
is given by equation 6.62, 


x(0) = МЕ? — sin? 0 (R — f? — (R—f) cos 0. (7.21) 
Figure 7.7 shows that if the field is not curved opposite to the curvature 
introduced by the changing path length, a defocus will result. The defocus as 
a function of scan angle 0 is 

defocus = c (0) — x (0). (7.22) 


Thus zero defocus requires 


c(0)7x(0). (7.23) 
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The path length c may be found from Figure 7.6 by noting that 


c? =a? + b? — 2ab cos (л — 27), 


FIELD CURVATURE 


WITH SCAN 


FIELD CURVATURE 
FOR ZERO SCAN 


Figure 7.7 Defocus due to path length change. 


and substituting for y to get 


1/2 
с = fat + ђе — 2ab cos [m — 6 — arcsin (asin 6) || | 


The construction of Figure 7.6 gives 


ОГ 


C= 


y _ bsin2y 
sin 0 sin 0 


b sin [ + arcsin (t 


sin 0 


a 
b 


sin б) | 
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(7.24) 


(7.25) 


(7.26) 


(7.27) 
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Finally, the relation 


_atz 
с- cos 8 (7.28) 


gives yet another equivalent form, 


a + bcos [ + arcsin (2 ЕН d] 


cos 0 


c= (7.29) 


The construction of Figure 7.8 allows the defocus equation for a flat 
focal surface to be derived. Here we deal with an axial ray, and consider the 
deviation of that ray from an ideal flat focal surface. The segment “а” is 
common to both deviated and undeviated ray paths, and since a mirror 
deviation y causes a ray deviation of 2 y, 


= р - b. - ~ 2 
e-b'—b T 2 — b = b[sec(2y) — 1] 22b y (7.30) 
for small y, or 
2 2 
c ба + by (7.31) 


2b 


The resulting blur diameter d is found by similar triangles from Figure 7.9 to 
be 


_eD_(f-o)D 
d F ғ , (7.32) 


and the corresponding angular dimension ô is 


_d_D(f-c) 
5 и == (7.33) 


This defocus can be compensated by designing the focal surface to be 
curved horizontally to compensate for the scan defocus and to be flat ver- 
tically to allow a flat detector array. Field lenses in the converging beam 
near the detector have been used for this purpose. An alternate focus correc- 
tion scheme mounts the mirror axis on a cam which moves the mirror in and 
out to prevent defocussing. 
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Figure 7.8 Construction of the linear defocus ina 
convergent beam scanner. 
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Figure 7.9 Construction of defocus blur diameter in a 
convergent beam scanner. 
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7.2.2 Parallel Beam Scanner 


The most common use of the oscillating mirror is as a parallel beam 
scanner located between a telescope and detector lens, as shown in Figure 7.10. 
The important feature of this type of system is that the mirror is not scanning 
in object space but near the exit pupil of a telescope. The beams corresponding 
to specific field angles coincide only in one plane, the exit pupil. Thus if the 
detector lens is to be kept small, the scan mirror must be located near the 
exit pupil to keep the spread of the scanned beams small. Such a configura- 
tion is shown in Figure 7.11. Figure 7.12 shows the scan geometry in more 
detail for a non-moving exit beam. The cases shown are those where the 
initial and final mirror positions correspond to the extremes of the field of 
view of the telescope. 

The required mirror dimension 0 is determined from Figure 7.12 as 
follows. To a first order approximation, the beam diameter D for a particular 
field half angle A/2 is related to the exit pupil diameter P by 


D-P cos). (7.34) 


Summing angles around the beam axis gives 


A+EtK+y+yem (7.35) 
OT 
-ТА К 
1=4 F T (7.36) 


As shown in Figure 7:12, the mirror length 2 is related to the mirror dis- 
placement y and the beam diameter D by 


_ р 
* 7 sn 3' (7.37) 
so that 
$2 ———À— ——. (7.38) 
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Figure 7.10 A magnified parallel beam oscillating mirror scanner. 
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Figure 7.11 Mirror placement for migrationless beam. 
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Figure 7.12 Mirror length construction 


This scanner does not scan a parallel beam if the object is not at 


infinity, as shown in Figure 7.13. When the mirror is displaced by o/2, the 
axial ray shifts by o and the linear shift in object space is 


s = Bo. (7.39) 
The angular subtense o of s at a distance O measured from the lens is 
-S _ Ва _ (= ) - A 
о= i-e 55) =o ( - 9) (7.40) 


Thus the angle scanned in object space changes as a function of the in-focus 
range. The detector angular subtense also changes from 


(7.41) 


Similarly, the motion in the image plane varies with in-focus range, as follows. 
Figure 7.14 shows how the image of a point closer than infinity moves as the 
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mirror is displaced. Reflecting the two ray paths about the displaced mirror 
position produces the triangle shown in the bottom of the figure. The angle 
0 by which the central ray shifts is derived as follows: 


sin & -7 (7.42) 
sin @ == (7.43) 
Е = arcsin RS (7.44) 
у = P (see section 7.2.1) (7.45) 
~ ga) for small 0, (7.46) 
or 
ө- 292. (7.47) 


opn — 2by ._ 2bY . 
х = 0б1=-——ъ Б (f + Ai), (7.48) 
where 
f2 
Ai = 5 (see equation 6.127). (7.49) 


Using the lens law, 


Dod 
( a+b THAT (7.50) 
or 
_ Al (7.51) 


atb  f(f+ Ai)’ 
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and 

b= ( f2 + А-а) Аі. (7.52) 
Then 

2 . 
2, (FAA) Ai (rai) 
x= Kf + 8) 
(7.53) 
= 2ү (с + a= a) Аа). 
The rate of image motion in the image plane is 
dy | Ai(f — 2, 
Sx =2 at f и и (7.54) 


The term 2f dy/dt is the rate for an infinitely distant object and (2 Ai 
(f—a)/f)dy/dt is the change in rate as the object moves in to a distance 
= f?/Ai. The percent error is (Ai(f—a)/f?) X 100% = ((f—a/O) X 100%. 


7.3 Rotating Reflective Drum Mirrors 


Oscillating mirrors are unsuited for high speed scanner operation because 
they tend to become unstable near the field edges and require high motor 
drive power. In such cases, rotating drum mirror assemblies such as that 
shown in Figure 7.15 are frequently used because their motion is continuous 
and stable. Consider the polygonal drum mirror shown in Figure 7.16, where 
the mirrored surfaces are circumscribed by a circle of radius то on the outside 
and are tangent to a circle of radius r; on the inside. This particular mirror 
is ten-faceted. The facet angle Өү measured from the drum axis is 


AT 


0g = n facets | (7.55) 


If we let the facet length be 0, it is evident from Figure 7.17 that 


R= 2r, sin (02/2) (7.56) 
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Figure 7.15 Rotating polygonal mirror scanner. 
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The ratios of the facet length to the outer radius and of the outer radius to 
the inner radius are tabulated in Table 7.1 versus number of facets. 

Now consider Figure 7.18, where two drum angular positionsare shown 
with an angular rotational displacement 0p. The geometrical construction 
shown demonstrates that the final displacement 6 of the center of the facet 
in the direction normal to the initial undisplaced mirror surface is given by: 


ô = rj (1 — cos Op) 
= го [1 — sin (0¢f/2) tan (6¢/4)] [1—cos Өр]. (7.58) 
Thus the scanning action of a rotating reflective polygon is similar to the 
action of an oscillating mirror with the exception that the center of a facet 
moves in and out at a rate 
dô . .. cin (Go 6 
dr ^ fi sin (бр Одр (7.59) 


assuming zero displacement at t = 1). 
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Figure 7.17 Facet length construction. 
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Table 7.1 
Characteristics of a Polygonal Mirror 


Number of Facet E fo. 
Facets Angle Го ri 
3 120° 1.732 2.000 
4 90° 1.414 1.414 
5 72° 1.176 1.236 
6 60° 1.000 1.155 
7 51.43° 0.868 1.110 
8 45° 0.765 1.082 
9 40° 0.684 1.064 
10 36° 0.618 1.051 
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Figure 7.18 Mirror motion with rotation. 
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If such a scanner were used in a converging beam, this oscillatory 
motion would change the focal point with time and could cause severe 
defocusing. Therefore these scanners are used primarily in parallel beam 
systems. When the angle each mirror makes with the vertical is the same, a 
non-interlaced rectilinear scan results. If the angles are incremented, interlaced 
or overscanned patterns can be implemented. For example, the reflective cube 
scanner can produce interlace factors of one, two, or four. Interlace is accom- 
panied, however, by an increasing image rotation as the face edges are 
approached due to the vertical mirror angling. 

Levi! has shown that the minimum outer radius г, necessary to keep 
the beam unvignetted during the scan of a parallel-beam is given by: 


-—— 2 (7.60) 


r , 
У 2 ОНДЫ - 1) 
2 2 
where D is the beam diameter, 6 is the beam deviation at the midpoint of the 


scan, 2y is the beam deflection angle produced by the scan, 0; is the facet 
angle equalling 27/N. For ô = 90°, 


-— bD _ 
| 6p — v) 
isin (255 


The polygonal scanner is not as effective at reducing beam shift as is the 
oscillating mirror because the facet motion is not exactly equivalent to that 
of an oscillating mirror, as is evident from Figure 7.19. Two different initial 
scan positions are shown in Figure 7.20 to demonstrate the approximate 
differences in beam shift which result. 

Gorog, et al.? assert that the maximum rotation rate Mmax [rps] ofa 
polygonal mirror is limited to the rate at which the mirror will fly apart, and 
is given by 


221 [ 8T. 
Mmax Ser. УВ m) (7.62) 


The mirror material parameters are: 


R (7.61) 


p = density 
T = ultimate tensile strength 
п =  Poisson's ratio. 


Actually the mirror would become useless due to facet deformation long 
before mirror disintegration. 
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Figure 7.19 An example of facet rotation. 


CASE 1 (SHOWN): 


MIRROR FACET CENTER AT 
PUPIL FOR UPPER BEAM. 


THEN THE FACET INTERSECTION 
DIRECTION WITH THE BEAM MOVES TOWARD 
OF ROTATION THE BEAM AND THE EXIT BEAM 


fe SHIFTS TO THE LEFT SLIGHTLY. 
Pd 
Pd 
CASE 2 (NOT SHOWN): 
MIRROR FACET CENTER AT 
> PUPIL FOR CENTER BEAM. 
Pd 


THEN MIRROR INTERSECTION 
MOVES IN AND OUT AND EXIT 
BEAM MOVES LEFT AND RIGHT. 


Figure 7.20 Beam shift. 
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Another scanner somewhat similar to the polygonal mirror scanner is 
the circular-segment-raster scanner shown in Figure 7.21 and referred to as 
the “soupbowl’’. This has been used to generate a stable scan in a converging 
beam. Another mirror arrangement is the “‘carousel’’ scanner shown in Figure 
7.22 which gives a rectangular raster. 


7.4 Rotating Refractive Prisms 


The third scanner type is the rotating refractive prism having 2(n+2) 
faces as shown in Figure 7.23 and which is rotated about its centroid. 
Figure 7.24 depicts the convergent beam configuration, which has the advan- 
tage that it allows an in-line system design. It produces the scan by laterally 
displacing the converging beam, sweeping it across the detector array. The 
principle of operation is shown for deviated and undeviated rays. First, it is 
evident that if the incident and exitant beams are in the same medium, the 
angle both beams make with the horizontal will be the same. This occurs 
because the refraction at the first surface is reversed by that at the second 
surface, giving zero angular shift. However, the focus is displaced vertically 
by y and horizontally by z. 

The ray deviation y is derived from the construction of Figure 7.25 as 
follows, where ¢, is the incident ray angle measured from the horizontal, 


a+b 


г = tan (y, - М) (7.63) 

а+Ъ = ttan (рі - М) (7.64) 

+= tan (o, — М) (7.65) 

b= t [tan (v, — №) — tan (e, — N)] (7.66) 

I = cos y (7.67) 

y= t cos y [tan (p, — N) — tan (p, — №]. (7.68) 
Since М = y, 


у = t cos y [tan (p, — y) — tan (e; — y) (7.69) 
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Figure 7.21 Rotating reflective ‘‘soupbowl” scanners. 
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Figure 7.22 Rotating reflective carousel scanner. 
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Figure 7.23 Rotating refractive cube scanner. 
sin (p2 — y) n, 
n wo Y m, 7.7 
sin (фу — ү) n, (7.70) 
SO 
nj . 
n, Jn (Yi — 7) 
tan ү) = — (7.71 
2 -V= "cos (py — y) ) 
апа 
Пу. 
p; 3m (жі — У) 
tan (о, — y) = === 
Vl- sin? (уҙ — 7) 
(7.72) 


ny 
_ т, ©“! — y) 
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JE os, Sin – 7) 
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Figure 7.24 Focal shift and image displacement. 
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Figure 7.25 Image displacement construction. 
Then 
Dni , ( 
— sin — 
оо, Ма-0 1 !'__ (13) 
d Y |соѕ (р; — 7) n, 2 | 
q! — | sin (y -») 
In air, n, = 1 and 
sin (o, — 7) sin (oj — y) 
-ftcosy|——7——— OT ———— |. 
y Y| cos а — y) (7.74) 


n? — sin? (y, — y) 


This has been expressed in various equivalent forms in the literature. 
For (v, — y) small and y small, 


у =tcosy Yi — у — ji: oa To yi ? (7.75) 
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Or 
~ 1 

у = 1005 у б У 41 -» 


| (7.76) 
= t cos y (p, — 7) A | | 
The ultimate small angle approximation is 
_ п — 1 
у = 6б =) (=). (7.77) 


Thus, over small angles, prism rotation produces ап approximately linear 
scan. Gorog, et al.^, Lindberg?, and Kulikovskaya, et al.* present these and 
other design equations. Lindberg also presents designs of refractive cube 
scanners used in Swedish military and commercial devices. 

The differential change in displacement with differential rotation is 
found by taking dy/dy. For a paraxial ray, o, = 0 and 


y 7-tsin y | -— SSY _ | | (7.78) 


Jn? — sin? y 


Then 
dy sin? y — cos? у біп? y cos? y 
—— =-t |соѕ у *——————— — ————————|: 7.79 
dy | y (n — sin? y)! (n? — sin? y)?” ( ) 
For tilt angles y << 1/2, 
yet yl — le terms of order 73. (7.80) 


For y= 0, the shift z in focus away from the lens with increasing ray angle Фф, 
is, according to Gorog, et al.? , given by 


п |) _ 005 1 (7.81) 
n? — sin? ¢, 
or for 9, << 1/2, 


а ми (lz) +07 > (1-4). (7.82) 


n 2n? 
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The central ray is shifted by 
- 1 ) 
z=t(1-+ . (7.83) 


They also derived the focal plane shift z for nonzero tilt angle у for a 
paraxial ray (ф, = 0), 


"E 
z-t|esy- 87 __ -4 — nom __| (7.84) 
(n — sin y)? (n? — sin? y»? 
For y << m/2, 
- L) 2t [5 n У 
2 (1-1 кү (2-2 - 3: | (7.85) 


Jenkins and White?, Levit, Smith®, and Waddell’ show that such а 
prism introduces transverse and lateral chromatic aberration, coma, spherical 
aberration, and astigmatism. The mechanical analysis used for the rotating 
mirror prism is also applicable for a refractive prism. Lindberg? reported that 
in a system application he removed the edges of a prism to increase mechanical 
strength and to reduce drag. 

Prism scanners have several good features. Their motion is continuous 
and therefore tends to be stable. Used in a convergent beam, the prism can be 
small in size, and the prism edges can be removed to reduce inertia and drag 
for low scan motor power. Mechanical noise can be very low because simple 
uniformly rotating mechanisms are used. Interlace is readily achieved by 
canting prism faces, as shown in Figure 7.26. The disadvantages are that scan 
efficiency is poor, the focal length shifts, and the prism introduces aberrations. 
Another problem is that refractive materials used for such a prism have high 
surface reflection losses and thus must be anti-reflection coated. However, 
these coatings have reduced efficiency for large incidence angles, so near the 
prism edges, reflection losses will be higher. If the prism temperature differs 
from the apparent scene temperature, the prism may also spuriously modu- 
late the scene image because of varying self-emission as a function of scan 
angle. 
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Figure 7.26 Rotating refractive prisms. 
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7.5 Rotating Refractive Wedges 


The rotating refractive wedge is one of the most flexible scan mechanism 
elements available. As discussed by Rosell® and by Wolfe’, the use of two 
rotating wedges allows generation of scan patterns which are unidimensional, 
circular, elliptical, rosette-shaped, or spiral. Wedge scanners must be used in a 
parallel beam because they introduce severe aberrations into a converging 
beam. Therefore we will analyze only the parallel beam case shown in Figure 
7.27. Let the incident ray be represented by an angle y, having two compo- 
nents фух and Ф ү, and let the ray between the wedges be o; with compo- 
nents yy and yoy. The coordinates (x,y) are chosen so that positive angles 
are rotated upward from the axis. The exitant ray is parallel to the axis to 
represent the parallel beam case. The wedge angular rotations around the 
axis are measured as positive when counter-clockwise and are denoted by 
0, and 0,. 
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y A W A W2 
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Figure 7.27 Rotating prism-pair scanner. 


The motion of the ray v, as 0, апа 0, change may be derived by 
following y, backward through the system using successive applications of 
the wedge deviation principle as follows: 


1. Find the deviation of фз as it leaves wedge 2 in the coordinates 
rotated by the wedge rotation 0; ; 

2. Resolve these y, ray components into unrotated coordinates and 
then into the rotated coordinates of wedge 1; 
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3. Find the deviated ray components leaving wedge 1 in rotated 
coordinates; 
4. Resolve these into unrotated coordinates to find the desired y, 
components. 
Figure 7.28 shows the constructions which are necessary. Working back- 
ward through the system, ray y3 enters wedge 2 with components 


in unrotated coordinates. Leaving the wedge the ray angle is the wedge 


deviation angle 6,. Resolved into unrotated coordinates, the components 
are 


Фәх = 5, cos 0 (7.87) 
апа 

Pay = -6; sin 05. (7.88) 
Resolving these components into the rotated coordinates of wedge 1 yields 

эх = 95x cos 0 — py sin 0, (7.89) 
and 

Q2y =Poy соѕ 0, + pzy sin 0. (7.90) 


Leaving the wedge, the ray components in rotated coordinates are 


Yix =Pox + д: (7.91) 
and 

ту = 2yY- (7.92) 

Finally, resolving these into unrotated coordinates gives 

Pix ^ 91x созбу * e y віп; (7.93) 
and 


YLY -91y COS 0, — 91x sin 0,. (7.94) 
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Figure 7.28 Ray deviation constructions. 
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Combining these equations gives the desired general equations for a two-wedge 
scanner: 


1x =[5, cos (0, —05) + 6; ] cos 0, + ô, sin(0, +6.) sin0, (7.95) 
and 
pı Y =ô біп (0, —0,) cos Ө, —[5, cos (0; —02) + ê, ] sin0,. (7.96) 


Table 7.2 gives some of the useful cases of the wedge pair for constant and 
equal wedge velocities. Spiral and rosette patterns can be generated using 
unequal velocities. A disadvantage of wedge scanners for line scanning is that 
the sinusoidal scan leads to a lower scan efficiency than is accomplishable 
with other scanners. Also, frame-to-frame and line-to-line registration require 
tight control of the prism angular velocities. When gear drives have been 
used, gear backlash and wear have been serious problems. 


7.6 Other Scanners 


The revolving lens is a brute force solution to the problem of scan 
generation. Figure 7.29 shows the simplest mechanization, a lens revolving 
back and forth between two stops. A variation of this is the “spin-ball”’ 
scanner depicted in Figure 7.30, which was used in an early FLIR. Both 
systems are impractical for large aperture systems because too much glass 
must be moved, and the spinball has the added disadvantage of poor scan 
efficiency. 

The dominant scanner models employ rectilinear scan generation, but 
another interesting though less frequently used class is that of circular 
scanners. Two approaches have been used: rotating a complete sensor having a 
radial detector array, or rotating the scene using rotating optical elements. 
The former approach is illustrated in Figure 7.31. Such scanners have the 
disadvantages that large sensor masses must be rotated, and that the associated 
gyroscopic effect may complicate gimballing. 

Rotating the scene rather than the sensor is much easier, and one 
successful method which has been used is the rotating V-mirror of Figure 
7.32. Another example of a circular scanner is shown in Figure 7.33. Major 
advantages of such scanners are that their motion is continuous and that any 
image rotation introduced by a pointing gimbal is removable by optical dero- 
tation. Finally, it is also possible to oscillate the complete electronics assem- 
bly as shown in Figure 7.34. 
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Table 7.2 
Scan motions for the four possible dual wedge scanner modes 


Case Conditions Фіх Yiy Motion 
1 0, -0,-0 25 cos 0 —20 sin 0 circular 
бі = p = 6 
2  05,-—0,-—0 26 соѕ 0(1—іп20) 0 unidimensional, 
51 =5, =6 ~ 28 cos 0 for bidirectional, 
small 6 approximately 
cosinusoidal 
3  04,-—0,-—0 28,соѕ20+(5,—5,)соѕ0 (62—51 )ѕіп 0 approximately 
5, #5, elliptical 
4 0, =0, + Р ô [cos0 +с050 cosP —6 [sin(@+P)+sin@] same as case 2 
| | except scan | 
0, =0 +5іп05іп(20+Р)] rotated by P. 
бі = 05 = 6 


DETECTOR ARRAY 


Figure 7.29 Simple nutating lens scanner. 
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Figure 7.31 Rotating sensor scanners. 
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Figure 7.32 V-mirror rotating optics scanner. 
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Figure 7.33 Another circular scanner. 
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Figure 7.34 Nutating focal plane scanner. 


7.7 Shading Effects 


In certain types of suboptimal FLIR designs the displayed scene has 
superimposed on it a broad, slowly-varying spurious signal. This image defect 
is called shading, after a similar but unrelated effect in television camera 
tubes. Shading is produced whenever either of two effects occurs. First, if 
the detector signal from a uniform source in the scene varies as a function 
of the scan angle, the displayed image of the source will appear to vary or to 
shade. Second, if the component of the detector background signal originating 
from sources not in the field of view varies, the ac coupling of the FLIR will 
pass the variation into the video and a scene-independent shading will result. 

There are two types of shading: signal-dependent and signal-independent. 
Signal dependent shading arises from four sources. The first is distortion in 
the optical system which results in differential image expansion or contrac- 
tion as a function of field angle. This has the effect of locally dimming or 
brightening the focal plane signal. Because FLIRs must always be ac coupled, 
the changes in background flux due to distortion are emphasized. The second 
is vignetting in the optical system which causes a rolloff of signal correspond- 
ing to the vignetting pattern. These two sources can often be controlled 
satisfactorily by good optical design. The third source is shading in a uniform 
target because the optical collecting area projected along the instantaneous 
scan angle varies as the cosine of the scan angle measured from the optical 
axis. 
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The fourth source of signal-dependent shading is due to variation in the 
angle at which a converging beam strikes a detector, and can be a much more 
serious problem. To a first order approximation, plane detectors are 
Lambertian receivers; that is, the signal output varies as the cosine of the 
angle between the input beam and the normal to the detector surface. All 
convergent beam scanners and all parallel beam scanners with pupil migration 
produce a change in beam angle, as shown in Figure 7.35. These variations in 
converging beam angle cause cosine dependent signal shading. In some detec- 
tors, increasing reflection losses with angle may also contribute to this effect. 

Signal-independent shading may occur whenever the scanner and optics 
are configured so that housing radiation reaches the detector. Again, conver- 
gent beam scanners and shifting-pupil parallel beam scanners are the offenders. 
A particularly simple case is shown in Figure 7.36, where a converging beam 
scanner has been modelled by an equivalent oscillating lens scanner for 
diagrammatic clarity. For any instantaneous scanner position, the detector 
must receive the radiation from the full solid angle 62, formed by the rays 
converging from the lens. However, to accommodate all scanning positions, 
the fixed detector must accept rays over the solid angle Q, > 9. Thus, 
the detector will receive radiation not only from the scene, but from the 
housing as well. The housing is not in focus, but the power reaching the 
detector from the housing may vary with position, so that spurious stationary 
patterns are introduced into the video. 

In the case of a migrating-pupil scanner, the detector lens must be 
oversized, with the result that different parts of the housing are seen with 
changing scan angle, causing shading if housing temperature differences exist. 
It is important to note that a parallel beam scanner which scans in only one 
direction unavoidably has beam shift in the non-scanned direction. In itself, 
such shading may not be too objectionable if it is limited to a few percent of 
the maximum displayed scene luminance. However, if the FLIR is to be used 
to provide inputs to an automatic target motion tracker or to some automatic 
target classification device, shading must be minimized. When optical design 
alone cannot eliminate shading, electronic compensation and optical baffling 
can be employed. 
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Figure 7.35 Beam shift due to scanning. 
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Figure 7.36 An example of aperture scanning. 
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CHAPTER EIGHT — THERMAL IMAGING SYSTEM TYPES 


8.1 Introduction 


The classified nature of most advanced thermal imaging development 
has prevented much discussion of device characteristics in the open scientific 
literature. The few examples are the thermographs described in references 1 
through 7, the commercial FLIR's in references 8 and 9, and the military 
FLIR's in references 10 through 12. From the viewpoint of complexity, the 
single element system considered briefly in Chapter One is the simplest con- 
ceivable system. However, fundamental limits or the detector state of the art 
may not allow such a system to have a sufficient thermal sensitivity, so that 
the per frame, per resolution element signal to noise ratio must be improved 
by adding detectors. 

Detectors may be added by preserving the single element scan pattern 
and stringing elements together horizontally, by making the scan one dimen- 
sionaland stringing elements vertically, or by a combination of the two. Some 
of the possibilities are shown in Figure 8.1. Each added element requires as a 
minimum its own preamplifier, so the price of sensitivity improvement is 
increased system complexity. The increased complexity may be reduced 
somewhat by the techniques of serial processing or of multiplexing, both of 
which reduce an n-channel system to a single video channel system. 

Conventional thermal imaging system designs differ from each other in 
two fundamental ways: the means by which the scene is dissected and the 
means by which preamplified detector signals are processed for video presen- 
tation. The particular implementations of the optics, scan mechanism, cooler, 
and display are not fundamentally significant if they meet at least minimum 
requirements. For example, whether a cooler uses a Stirling or a Vuilleumier 
cycle may influence cost, weight, or reliability, but it does not categorize a 
system. As we shall see, however, variations in scanning and in video pro- 
cessing may radically alter the functioning of the system. 
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Figure 8.1 Possible uses of discrete detectors. 


8.2 Serial and Parallel Scene Dissection 


In addition to distinctions between scanning mechanism types such as 
convergent and parallel beam scanning, and one- or two-dimensional scanning, 
there is a distinction between serial and parallel scene dissection and detector 
signal processing. The differences between the two scan types are shown in 
Figure 8.2. In parallel scene dissection, an array of detectors is oriented per- 
pendicular to the primary scan axis, as in a unidimensional detector array 
used with an azimuth scanner. All of the detector outputs are amplified, 
processed and displayed simultaneously or in parallel. In serial scene dissec- 
tion!?^, an array of detectors is oriented parallel to the primary scan axis 
and each point of the image is scanned by all detectors. The detector outputs 
are appropriately delayed and summed by an integrating delay line which 
superimposes the outputs, thereby simulating a single scanning detector, or 
they may be read out one-for-one on a similar array of scanning display 
elements. 

One type of scanning may be more advantageous than the other 
depending on such factors as overall allowed sensor size, allowed power 
consumption, and performance-to-cost ratio. The primary advantage of parallel 
processing is that an extrefnely compact special-purpose sensor can be built 
using converging beam scanning with both detector and display scanning off 
of the same mirror. This advantage may be lost when general-purpose parallel 
beam scanning is used. The major disadvantages of parallel processing are that 
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Figure 8.2 (a) Serial and (b) parallel scanning. 


detector D* and responsivity variations in the array cause image non- 
uniformities, and that ac coupling artifacts due to the existence of different 
average scene values on different detectors cause nonuniformities. 

The primary advantages of serial scanning are that nonuniformities are 
eliminated because the scene is dissected in effect by only one detector, and 
that the number of detectors required is dictated by sensitivity requirements 
rather than by the number of scan lines, as in parallel scanning. The primary 
disadvantage of serial scanning is that it must be done in a parallel beam, so 
the optics tend to be more complex. The SNR equation for a serial FLIR is 
not intuitively obvious. The following derivation for the effective D* and 
SNR in a serial scanner demonstrates how serial processing optimizes the 
performance of an array regardless of D* and responsivity uniformities. 

The integrating delay line shown in Figure 8.3 linearly sums the detector 
signals and takes the root sum square value of the detector noises. The 
effective D and the effective signal-to-noise ratio of this simulated single 
element detector are combinations of the characteristics of the individual 
detectors. The performance can be maximized by appropriately selecting 
channel gains going into the delay line as follows. 

Assume that each detector has the same radiant power P incident on it, 
and that there are n detectors, each having responsivity R; and detectivity ДА 
The signal outputs 5, and the noise outputs N; when each detector is pream- 
plied with an adjustable gain G; are 
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5 =P R; С; (8.1) 
and 
yA, AF ҚС; 
N; =R; Gi pF “зу. (8.2) 


1 1 


Each detector is preamplified with an adjustable gain G,, and each out- 
put is delayed by a time interval equivalent to the geometric distance of the 
electrical center of each detector from the final detector in the scan. These 
assumptions are represented by Figure 8.3. The output of the delay line is a 
signal S, and a noise N, which appears to have been produced by a detector 
with detectivity D? and responsivity R,. The outputs are given by 


n 
S =P È К.С. (8.3) 
о . iid 
i-1 
and 
2 1/2 
n ЈЕ 6; 
No=| Z СБ | (8.4) 
o |. D. 
i=l 1 
SCAN DIRECTION 
P P P P 
В. .... .... Rn 
P ~ i i INTEGRATING 
DELAY LINE 
S5 + Ni 
s DELAY 
DIRECTION 
S, +N, Э: 
5; +N В 
S, +N, D So + № 


Figure 8.3 Integrating delay line technique for serial scan. 
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The output signal-to-noise ratio is 


n 
—9 = dl _____ (8.5) 


No п К, С; 2711/2 


Defining a relative gain 


ен (8.6) 


such that 2, = 1, 


Р > Ri 5 
Ll oL. (8.7) 


| D; 
1 1 


It is the quantity S o/Ng which must be maximized by appropriate 
selection of relative gains g.. To maximize S o/No with respect to a particular 
gy, we take the partial derivative 0(S o/ NoJ/ да, and set it equal to zero. Then 


So 
№ 


S 
Ce Е Р Rk Е Р Rg? бұ 1 "i (8 8) 
де, R 3 1/2 р? R; g; 3/2 ' 
У, 
і Di 1 Di 
or 
Ку сұқ 

Ce =k k ZR g. (8.9) 

. 2 . 

і 1 D, 1 


R 2 В. g. V В g 
з + E Ку _ Ак Кра, + 2 IN (8.10) 
D. 2 | 1 1 

iJ D. БЕК 
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= ү (8.11) 


Thus we must solve K equations for К = п unknowns. For the simple case 
where n = 2, the solution is 


К, [DV 
g> =R n» for 2, = 1. (8.12) 
This suggests а general solution 
Ri (Di | (8.13) 
5^ R; \D, | 


Substituting this assumed solution into the set of equations 8.11 yields 
an identity, so the assumed solution is correct. Substituting the solution into 
equation 8.7 for S,/N,, we find that 


| PED? 
о - > Шы (8.14) 


—Q = P 
№, > D 1/2 i 
i 


Thus the maximum theoretical signal-to-noise ratio corresponding to the 
optimum gain settings is the rss value of the individual ratios. If the respon- 
sivities are all equal, this implies that the effective specific detectivity D? is 


ж _ ж211/2 
ро=[> 07] > (8.15) 
and by the mean value theorem 


ж ж 
D* = уп О, ос (8.16) 
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where 
72 
D* = > Di 


rms | n 


(8.17) 


8.3 Performance Comparison of Parallel and Serial Scan FLIR's 


The author is a firm believer in the superiority of serial scan techniques 
for most FLIR applications. This section is an undisguised apologia for serial 
scanning, and should be recognized primarily as an expression of opinion. 
The assertions made here are based on present and foreseeable technology, 
but could easily be disproved by radical improvements in detector fabrication 
technology. 

Two assertions are made regarding the superiority of serial scan: 


1. A given level of objective performance can be achieved with a 
serial scan system using, at most, one-tenth as many detectors as 
a parallel scan system. 


2. If a serial scan and a parallel scan system both exhibit the same 
objective performance, the serial scan system will induce the 
superior subjective impression of image quality. 


These two assertions are made primarily on the basis of practical FLIR 
experiences with competing serial and parallel scan FLIR's having similar 
objective performance. The assertions are supported by two technical argu- 
ments. The first argument proceeds as follows. Postulate two FLIR's, one 
serial and one parallel, which are identical in the following respects: 


clear aperture and optical transmission; 

field of view; 

detector angular subtense; 

frame rate; 

number of active scan lines per picture height; 

spectral region of operation; 

dependence of D* and responsivity on wavelength and frequency. 


IAM RB WN DS 


From the discussions of Chapter Four, we recall that_the effective thermal 
sensitivity of a device improves as the ratio NETD|/F ‹ decreases. An appro- 
priate NETD equation to use for this comparison is 


TNabA fg «үп 


NETD - : (8.18) 
aB Ag то D¥(A AW 


p) AT 


where n is the number of integrated detectors. 
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If we compare the two systems on the basis of NETD//F, and for the 
above seven conditions of equivalence, then the comparison reduces to a con- 
sideration of a quality factor Q defined by 


1 


The reference noise equivalent bandwidth Afp depends оп the dwelltime 
and the combined detector/preamplifier noise characteristics. The D*(A, ) 
depends on the detector bias which can be used, whether or not the detectors 
are BLIP and the cold shielding efficiency, and the economical manufac- 
turing yield. 

At the time of this writing, useful detectors still have significant 1/f 
noise within the bandpass of parallel scanners, but insignificant 1/f noise 
within typical serial scan bandpasses. Yields of BLIP-limited detectors in 
large (200 element) arrays are poorer than in small (20 element) arrays. 
Practical cold shields tend to take simple shapes such as slots rather than 
arrays, so that parallel scan cold shields are longer and therefore less efficient 
than serial scan cold shields. To preserve system compatibility with low-power 
cryogenic coolers, parallel scan arrays of 200 elements must be operated at 
low bias to keep heat loads low, so flexibility in bias to achieve optimum 
D*(Xp) is lost. These four considerations at the present time allow the Q 
for a serial scanner with n = 20 to equal the О for a parallel scanner with 
200 parallel-scanned elements. 

The second argument regarding the subjective superiority of serial scan 
image quality also is primarily due to the difficulty of producing parallel scan 
arrays with uniform detectivities and responsivities. If a parallel scan system 
does not have some form of automatic responsivity control (ARC), respon- 
sivity nonuniformities will cause large spurious signal differences from line 
to line because of uneven responses to the high thermal background. On the 
other hand, if an ARC is used, it will exaggerate the line-to-line detector 
noise differences. Serial scanners avoid both problems by smoothing respon- 
sivity and detectivity nonuniformities in such a way that the maximum 
possible SNR is achieved. 


8.4 Signal Processing 


8.4.1 Low Frequency Response and DC Restoration 


Thermal imaging systems couple the detector signal to the amplifying 
electronics using a dc-blocking (or ac-coupling) circuit for three reasons. 
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First, good contrast rendition requires background subtraction, and this func- 
tion is approximated by a dc blocking circuit. Second, any dc biasing potential 
on the detector must be removed before signal processing. And third, the 
interferring effects of detector 1/f noise must be minimized. The simplest 
circuit which will provide these functions is the RC high-pass circuit shown 
in Figure 8.4. 

The sine wave response is trivial to find, but what we are interested in 
is the response of this circuit to any input. The input and output voltages are 


t 


lif, Qo |. 8.20 
e: (t) == | KOdt — + ОК (8.20) 
O 
and 
e;(t) = КОК, (8.21) 


where Q, is the capacitor charge at time equals zero. Laplace transformation 
of equations 8.20 and 8.21 yields 


| Qo 
E, (s) = I(s) + Cs t I(s)R, (8.22) 


e4 (t) i(t) R e(t) 


Figure 8.4 AC-coupling circuit. 
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and 
E, (s) = I(s)R. 
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(8.23) 


Solving equation 8.22 for I(s) and substituting into equation 8.23 gives 


Q 
E,(s) — Cs 
Е, ($) = К 


RCs RQo 
-— —  E,(s) –———— 
RCs + 1 RCs + 1 


S Q l 
= E() С” 
+ — С s t 1I/RC 
RC 


Expanding the first term by partial fractions, 


1 1 Оо 1 
Е, (s) = E, (5) LR «| — — 


Inverse Laplace transformation gives! 


1 Qo 
e; (t) =e, (t) ж - RC et/RC + 50) c eU RC 


Q 1 
=e, (t) — — et/RC _ RC e, (t) ж e ИКС, 


C ЗРІС. 


(8.24) 


(8.25) 


(8.26) 


Equation 8.26 gives the response е, (t) of the circuit to any input е; (t). 
Assume that Q, is zero for the moment, and let e, (t) be ó(t-7), a unit 


area impulse at time t = r. Then 


1 
t) = &(t т) - — e(t )/RC, 
e; (t) = 6(t — т) RC 
which is shown in Figure 8.5. 


1 where the symbol * denotes convolution 


(8.27) 
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Figure 8.5 An impulse and the impulse response of an ac-coupling circuit. 


Integrating e; (t) over all time gives the interesting result 


оо со 


|| „ба = | it — па – — || e(t —T)/RC at 


О O T 


(8.28) 


This same result can be proved for апу е, (t), so this circuit has the property 
that the average value of its signal output is zero, as one might expect since 


the circuit blocks dc. When Qg is not zero, 


оо 


| e, (t) dt = — 


О 


Qo 


RC? 


(8.29) 
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It is instructive to consider the response to step and pulse inputs of 
unit amplitude. For a step input 


e, (t) = U(t — т) e(t /RC (8.30) 


as shown in Figure 8.6. From equation 8.26, the step response as shown in 
Figure 8.6 is 


e, (t) = Ut — т) e(t /RC. (8.31) 


For a pulse with pulse width tp: 


t 


e, (t) = Rect 5) (8.32) 
p 


and 


e; (t) = Rect | = У) e C7 ТУКС -U(t-t -т) (e Ср ту КС 1) 
Р 


e(t-tp-7)/RC_ (8.33) 


This function is shown in Figure 8.7, where the areas above and below the 
time axis are equal. 


ei (t) 
e» (t) 


Figure 8.6 Step function and step response. 
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e, (t) 
ез (t) 


Figure 8.7 Rect function and response. 


Clearly any low frequency elements of the scene will suffer from dc 
droop and cause undershoot whenever this circuit is used. This causes three 
kinds of image defects. The first type occurs with large moderate tempera- 
ture difference objects, and is not very serious. An object such as that shown 
in Figure 8.8 will when imaged by a unidirectional ac-coupled scanner pro- 
duce the droop and undershoot shown. In some cases this behavior may 
mask the presence of other targets. Far more serious is the presence of a 
very hot source, as shown in Figure 8.9. Since the average value of the 
circuit output is zero, the positive signal response to the hot target will be 
accompanied by a negative signal response of lower amplitude but of longer 
duration. This undershoot can easily extend across the display, as shown in 
Figure 8.9. The third kind of image defect arises in a multi-element scanner 
and is related to the charge Qo; on the circuits of each of the j channels, 
given by 


Од = | igit (8.34) 


- оо 


where t = 0 is taken to be the observation time. Thus this charge represents 
the past history of the channel. It is likely that each channel has not had the 
same average signal on it, so the Qoj will in general not be identical. Referring 
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TARGET IMAGE WITH UNIDIRECTIONAL IMAGE WITH BIDIRECTIONAL 
SCAN SCAN 


Figure 8.9 Non-dc restored effects with a small hot source. 


back to equation 8.26, we see that this charge determines the level of e; (t) on 
each channel. Thus two channels viewing the same object against different 
backgrounds may give different outputs. This is clear from consideration of 
the target shown in Figure 8.10 and the channel responses shown in 
Figure 8.11. 


— — CHANNEL 1 


CHANNEL 2 


Figure 8.10 A target which demonstrates the utility of dc-restoration. 
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Figure 8.11 (a) Channel responses ina non-dc-restored system to the target of 
Figure 8.10 assuming RC <<Тҙ and (b) the resulting imagery 
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There is no satisfactory simultaneous solution to all three image defects. 
DC droop can be eliminated by reducing the time constant, but this increases 
the duration of the undershoot. Undershoot can be cured by increasing the 
time constant, but this increases droop. Channel responses can be made more 
uniform by dc coupling, but this complicates detector biasing and reduces 
contrasts for small delta-T targets. Droop and undershoot may be reduced by 
greatly increasing the signal frequency passband relative to the low frequency 
cut, as in serial scanning, but this does not eliminate the third defect. 

A reasonably satisfactory solution is serial scanning combined with the 
the technique of artificial dc restoration. One version of dc restoration is 
shown in Figure 8.12, where the detector scan includes a view of a thermal 
reference source during an inactive portion of the scan cycle. This source 
might be a passive source such as an optical stop, or an active source such as 
a heated strip. When the detector sees this source, the detector signal coupling 
capacitor output is shorted to ground through a resistor, allowing the capaci- 
tor to charge to a dc value characteristic of the detector signal due to the 
source. When the detector reaches the active portion of the scan, the circuit 
resumes normal operation and only the signal variations around the reference 
capacitor voltage are passed. Restored and unrestored video are compared in 
Figure 8.13. 


INACTIVE PORTION 
OF SCAN 


ACTIVE PORTION THERMAL 
OF SCAN REFERENCE 
SOURCE 


Figure 8.12 DC restoration by thermal referencing. 


6.4.2 Multiplexing 


In systems which use parallel-processed n-element detector arrays, it 
is often desirable to reduce the n video channels to a single video channel. 
This might be necessary in order to reduce electronics power consumption, 
to avoid ganged multichannel video processing, to interface with a simple 
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Figure 8.13 Non-restored and restored signals. 


single element display such as a cathode ray tube, or to transmit video data 
over a single communication channel. This reduction of n channels to one 
channel is called multiplexing and may be accomplished either electronically 
or electro-optically. 

Consider the simple case of a linear contiguous detector array scanning 
in azimuth as shown schematically in Figure 8.14. In electronic multiplexing, 
the preamplified detector outputs are fed to an electronic switch which 
samples each element in sequence and repeats the process periodically. This 
switch is called a time division multiplexer because it time-shares a single 
output channel among the input channels. The switch then feeds a video 
processor and a display, here shown as a CRT, whose scanning element 
traces in space the path in time taken by the switch, placing the multiplexed 
signals in the appropriate places on the screen. If the switch is operated at 
a sufficiently high rate, each resolvable element of the image will be sampled 
frequently enough so that structure-free imagery will be presented. A typical 
electronic multiplexer is analyzed from a sampling viewpoint in Chapter Nine. 

The fabrication of an electronic switch which is sufficiently noiseless, 
transient-free, and fast for high performance applications has been a difficult 
problem in the past. The complexity of this type of system tends to make it 
more expensive and less reliable than some other types. Karizhenskiy and 
Miroshnikov!* have described the principles of a generalized multiplexed 
and interlaced system. 
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Figure 8.14 Electronic multiplexing. 
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Figure 8.15 Electro-optical multiplexer. 
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The electro-optical multiplexer is a particularly easy way to reduce 
electronic processing complexity. It operates by generating a complete low 
signal level video display, typically by a scanned electro-luminescent diode 
array, which is then viewed by a television camera to produce a single video 
channel as shown in Figure 8.15. There are three possible disadvantages of this 
scheme. First, the camera’s spatial sampling rate (number of vertical lines) 
must considerably exceed the thermal scanner sampling rate to prevent 
aliasing. Second, the synchronization between the thermal and television 
framing action must be perfect so that the bright line which is the locus of 
points at which the thermal and camera scans intersect simultaneously will 
be constant in position on the display. This causes the bright line to be incon- 
spicuous and allows it to be blanked electronically if desired. Third, the 
vidicon usually does not completely erase the image on its target surface at 
the end of a field, so that moving hot sources will tend to produce ghost 
images. 

Both multiplexing techniques have offsetting advantages and disadvan- 
tages. The electronic multiplexer usually has no MTF loss, while the electro- 
optical cannot avoid TV camera MTF losses. The electronic may introduce 
noise and transients, whereas the electro-optical usually does not. Both exhibit 
sampling defects, the electronic by introducing a second raster perpendicular 
to the first, and the electro-otpical by the overlaid TV and thermal rasters. 


8.5 Scanning FLIR System Types 


Scanning FLIR systems are characterized by their scan patterns and by 
their video processing of preamplified detector signals. Video processing 
typically takes one of three forms: multichannel serial or parallel processing, 
standard television format processing, or pseudo-TV processing. In multi- 
channel processing there is a one-to-one throughput of detector elements to 
display elements. In standard video the final electronic output of the system 
is a single video signal whose characteristics conform to one of the commer- 
cial standards and which is suitable for use with standard composite video 
equipment. In pseudo-standard video, a TV-like video format and monitor 
are used. 

Recent systems have tended to fall into one of the four categories which 
represent the most reasonable combinations of scan and of signal processing 
types, which are: 


1. Parallel scan — parallel video; 
2. Parallel scan — standard video; 
3. Serial scan — parallel video; 

4. Serial scan — standard video. 
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Pseudo-TV-compatible systems tend not to be as cost effective as other 
types, and will not be considered here. We will consider the typical features 
of each of the other categories and discuss their advantages and disadvantages. 

The simplest conceivable multidetector system is the parallel scan 
parallel video system shown in Figure 8.16. This system requires only a col- 
lecting converging optic, a two-sided scanner, a detector array, amplifying 
electronics, display drivers, and an eyepiece. The system shown is a convergent 
beam scanner which uses one side of an oscillating thin mirror to generate 
the thermal scan and the other side to generate the visible scan. The detector 
signals are amplified and shaped to appropriately drive the visible light 
emitters, and a visible optic magnifies and focuses the emitter scan for the 
Observer. 

Its advantages are that no scan synchronizer is needed, the scanner 
requires little power, and the display is compact. Its disadvantages are that 
only one person may observe the display, loss of any part of a channel causes 
loss of one line of video, all channels must be balanced individually, all 
channels must be controlled simultaneously (ganged), the channels may 
require dc restoration, and video waveform shaping must be performed in 
each channel. In spite of its disadvantages, the parallel-parallel system is a 
very cost effective way to achieve a small viewer. 
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Figure 8.16 Direct view parallel scan — parallel video system. 
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The parallel scan-standard video system with signals converted to stan- 
dard TV video may have a front end identical to that of the parallel-parallel 
system, with video conversion to a standard format being implemented by 
electro-optical multiplexing, as shown in Figure 8.17, and as described in 
in Section 8.4.2. This system views the display of a parallel-parallel system 
with a television camera, thereby producing standard video. Perfect synchro- 
nization of the frame rates of the thermal and of the television scans is 
necessary, and the effective matching of the dynamic ranges of the two 
devices is delicate. 

The serial scan-parallel video system may be implemented as shown in 
Figure 8.18, where a two-dimensional array of detectors is coupled one-for- 
One to a similar array of light emitters. This approach uses the observer’s 
inherent temporal integration to produce the image-smoothing effect asso- 
ciated with serial scanning, and combines this with the inherent simplicity of 
a directly viewed light-emitting diode array. 

The serial scan-standard video system shown in Figure 8.19 uses 
the most direct method for generating standard video. It avoids schemes 
using multiplexers and scan converters and closely models the television 
process. This system combines the advantages of serial scan discussed in 
the previous chapter with the advantage of low cost for displays and for 
accessories such as video recorders, symbol generators, data retransmitters, 
and automatic target trackers. 

The ultimate performance limits of thermal imaging systems are not yet 
clear because the state-of-the-art is more constrictive than theoretical limita- 
tions such as those due to diffraction and to quantum noise. The papers by 
Edgar!? and by Williams! are interesting in this regard. Edgar’s basic con- 
clusion was that the detector state-of-the-art limits may require use of larger 
detectors than desired by resolution criteria, so that a larger than desired 
sensitivity results. Williams' conclusion was that wide field, multiline systems 
may require F/#’s so small or focal lengths so short that they are impractical. 


8.6 Non-Scanning Thermal Imaging Systems 


Conventional mechanically-scanning FLIR is considered by some to be 
an inelegant, inefficient, and unacceptably costly technique for thermal 
imaging. Raster, framing, detector cryogenic cooling, electronic signal pro- 
cessing, and moving optics are the often-cited defects of conventional FLIR's 
which stimulate continuing interest in mechanically and electronically simpler 
devices. Although many devices with more desirable features have been con- 
ceived, none has as yet achieved the thermal sensitivity, the resolution, and 
the response time necessary to displace scanners. 
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Figure 8.18 A serial scan-parallel video system using a dual-axis scanner. 
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Figure 8.19 A serial scan-standard video system. 
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The problem is not that fundamental physical limitations exists which 
prevent non-scanning devices from achieving the necessary performance. It 
is rather that the poor radiation contrasts in the infrared make severe 
uniformity demands and necessitate that a linear energy conversion be 
followed by a background subtraction. Thus all the non-scanning imagers 
such as semiconductor vidicons, image converter tubes, and self-scanned 
charge-coupled imagers which work well in the visible will not function in the 
infrared without improved uniformity and a complicated electronic or electro- 
optical background subtraction scheme. 

Since detection or energy conversion in an incoherent device necessarily 
occurs before background subtraction, the responsivities of the individual 
non-scanning detecting elements must be extremely uniform to prevent 
Spurious image modulation. For example, consider a two dimensional matrix 
of detecting elements which fills an image plane and which linearly converts 
power in the 8 to 14 micrometer waveband to visible light. For a 300°K 
background, the integrated spectral radiant emittance over the band, per 
Chapter Two is 


Wp = 1.72 X 107% watts/cm?, 


the change of W with T is 


5% = 2.62 X 104 watt/cm? *K, 


and the radiation contrast is 


Ср 70.74 %/°K. 


Let us suppose that the individual elements of the matrix are resolvable 
by the eye, so that the spatial frequency composition of the matrix falls 
within the eye's bandpass. Further assume that the element responsivities 
expressed in foot-Lamberts/watt in the array have a fractional rms deviation 
of Op relative to the average element responsivity R. Then before dc- 
subtraction, the image of a uniform target with temperature difference AT 
against a 300°K background Wp will be embedded in a fixed pattern noise 
induced by the responsivity variations and having an rms value of 
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The image signal will have an average value 


5 = ШЕН R. (8.36) 


If detection is now followed by subtraction of a background level 
equivalent to a spectral radiant emittance Wp ' < Wp such that the subtraction 
does not clip the fixed pattern noise, (i.e., Wp — Wp > 30p Wp) then the 
point signal-to-noise ratio in the target image is 


= ү (8.37) 


As an example, consider a background subtraction which removes all 
signals up to a level equivalent to 290° K, ог 


Wg = 1.56 X 10? watt/cm?. 


For a AT of 1°K, the above condition is satisfied for small ор, апа 


2.62 Х 10" 
ор (1.72 Х103) 


SNR = 


=0.0152/op. 


Consequently if such a device is to generate a SNR of 5 which is a typical 
per frame SNR fora scanner viewing a 1°K AT, the rms responsivity variations 
must not exceed 0.3%. 

This analysis has not accounted for the fact that spatial noise ina FLIR 
isnot fixed-pattern in nature on a per frame basis, but random from frame to 
frame so that an SNR improvement by T,F occurs. Since in the hypothetical 
device we are discussing, the responsivity "noise" is fixed in time, the 
uniformity must be improved by a factor of J6 to compare favorably with 
a 30 frames per second FLIR, so that we now require a 0.12% uniformity. 
Thus even if the device is essentially noiseless on a per-element basis, 
responsivity variations introduce a fixed-pattern noise which interferes with 
performance. 

The uniformity requirement derived above is likely to remain beyond 
the state of the art for the near term for various types of two-dimensional 
arrays, so the prospects for staring sensors for terrestrial thermal imaging are 
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not good. By substituting Q’s for W’s in the above derivation, the reader may 
verify for himself that a device which converts photons rather than power has 
even more stringent uniformity requirements. Framing sensors suffer from 
the uniformity problem too, but the use of one-dimensional arrays permits 
better detector uniformity to be achieved. Unidimensionality of responsivity 
variations on the display also permits the observer to “look through" the 
noise easier than if it were two-dimensional. 

If the uniformity problem can be solved, continuously staring systems 
Should have one very powerful advantage over framing sensors. This is that 
the eye integrates signals and rms’s noise at a point over a time period of Т, 
so that a framing system's noise is rms'ed only a few times within То, 
whereas a staring system's noise is rms’ed continuously. Thus a staring system 
allows a reduction of thermal sensitivity requirements from those of a framing 
sensor by about one to two orders of magnitude. 

Perhaps the best way to appreciate this effect is follow Budrikis! " 
by modeling the eye's temporal response in the frequency domain by some 
approximate frequency response such as that of the low pass filter 


Y-[l*(2m T, f]. (8.38) 


Consider the images of a point source in a framing and in a staring 
system, both of which have the same spread function and signal transfer 
function, and have no fixed pattern noise, so that the signal components in 
each image are identical. Then the images produced by the two systems 
differ only in their noise components. The eye will spatially integrate 
signals and noise in the same fashion over both images. If we further assume 
that the displayed spatial noise power spectral densities differ only by an 
amplitude scale factor A, then the spatial integration phenomenon will have 
no effect on our comparison of the two systems. 

Now we consider the apparent or perceived temporal equivalent noise 
bandwidths for each system. Let 22 (f) be the displayed temporal noise power 
spectral density for the framing sensor, and A? g? (f) be that for the staring 
system. Then the perceived noise equivalent bandwidth for the framing sensor 
is 


со 


1 
Аг = > 2 (f) df. 8.39 
Fm J: (f) (8.39) 
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and for the staring sensor it is 


Afs = | A? g DIL e Qm T, 71" df. (8.40) 
О 


If we desire that both systems have the same sensitivity, we may set their 
bandwidths equal and solve for A? , yielding 


| 
[T,F] 


| 2 war 
A2 = О (8.41) 


[ g? (f) [1 + Отт, f)?]-! df 
O 


Since thermal sensitivity scales approximately with the square root of the 
bandwidth for well-behaved noise in otherwise identical systems, we may 
use the quantity A as a noise equivalent comparison parameter, 


А 7 

| emar 

А = (T,EY 12 ———8——————————| . (8.42) 
[е citi «QT, 6] а 


O 


The significance of A is that a staring system's objective thermal sensitivity 
can be worse than that of a similar framing system by the factor A. 

As a simple example, consider noise which is white up to an abrupt 
cutoff f,, with 


1 
f, ҒЫН (8.43) 
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Then 


f 1/2 


А = (Т,Б)? | 


E (8.44) 


-2(2) | 
\F 


For a typical multielement framing sensor, f, is 10^ Hz, and F is 30 Hz, so 
that 


4\1/2 
a =(40—) = 36.4. 


Thus some critical element of the system can have a performance which is 
36 times worse than normally required. For example, the collecting aperture 
could be smaller, the material D* lower, or the electronic processor noise 
higher. It must be emphasized, however, that this advantage is meaningless 
if responsivity uniformity is not achieved. 

The history of attempts to produce a satisfactory non-scanning thermal 
imager date back at least to 1929. Since that time not a single device has 
exhibited the combination of update rate, thermal sensitivity, and spatial 
resolution necessary for useful television-quality terrestrial imaging, although 
at least thirteen different concepts have been implemented. The following 
sections briefly describe the principles of operation of seven classes of non- 
scanning devices. 


8.7 Oil Film Deformation Devices 


The first thermal imaging system used the evaporagraphic principle 
described by Czerny!*, Czerny and Mollet!?, Robinson, et al.??, McDaniel 
and Robinson?!, Ovrebo, et al.?? and Sintsov??. The construction details 
vary, but the basic principle of all evaporagraphs is the same. A thin mem- 
brane is immersed in a supersaturated oil vapor in a chamber in thermal 
equilibrium. At equilibrium the rates of condensation and vaporization of the 
oil on the membrane are equal, but if a thermal radiation pattern is imposed on 
the membrane using an optical system, disequilibrium will result. If the oil 
film and the membrane are sufficiently thin, interference colors will be 
observed when the film is illuminated by white light. These variations due to 
differential evaporation of the volatile liquid correspond to thermal radiation 
absorption by the membrane. 
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In the early 1950’s the evaporagraph principle suggested to Foshee? 4 
and his associates that thin film properties might be used in other ways to 
image thermal radiation patterns. They used a low-viscosity fluid film on the 
back of a thin blackened membrane, as in the evaporagraph, but now the 
film was illuminated and viewed through Schlieren optics. Surface deforma- 
tions resulting from surface tension variations with local film temperature 
were thus made visible. This device did not produce satisfactory imagery 
because of excessive image spread due to thermal conduction in the film. 
The Gretag Panicon?? overcomes this problem by allowing the film to heat 
up only discrete points of two dimensional pattern by using a mesh mask to 
isolate small film areas. 

All of these devices suffer from the fact that they are thermal sen- 
sors; that is, image formation depends on scene-radiation-induced differential 
heating of a sensing surface. There are four serious problems with thermal 
sensing surfaces: 


]. The necessity for good thermal sensitivity makes them susceptible 
to the influence of housing radiation and requires thermal isolation 
of the surface from all sources but the scene. 


2. The necessity to image scene motion requires a short thermal time 
constant, which implies a rapid conduction of heat inconsistent 
with limiting image spread to achieve good resolution. 


3. There is no simple means for background subtraction, so contrasts 
tend to be poor. 


4.  Uniformity of response is difficult to achieve over a relatively large 
sensing surface, so spurious modulations frequently are present. 


8.8 Infrared Semiconductor Vidicons 


The eclipse of mechanical television scanners by television camera tubes 
has inspired many researchers to attempt the same with FLIR, but thus far 
the record has not been good. The most overwhelming problem has been 
that the high background fluxes in the far infrared in terrestrial scenes tend 
to saturate the charge generation surfaces and to deplete electron readout 
beams, so that background subtraction is a necessity. This means that most 
vidicon schemes will work only against reduced backgrounds, such as the cold 
of space. Another serious problem is that vidicons which linearly convert 
either thermal power or quanta to electrical signals require extraordinary 
uniformity of response to avoid spurious modulation, as discussed in 
Section 8.6. Thus far four types of infrared semiconductor vidicons have 
been developed. 

The earliest attempts were reported by Redington and van Heerden? 
in 1959. They investigated the use of camera targets composed of photocon- 
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ductive doped silicon and germanium operated at cryogenic temperatures. 
Gold-doped silicon targets produced the best results, but spectral response 
was limited to about 2 um, and satisfactory sensitivity was not conclusively 
demonstrated. In 1962, Heimann and Kunze?" reported on the Resistron, 
at 2 um-cutoff IR vidicon which used a lead sulfide target. Augmentation 
by a 1.5 kilowatt illuminator was necessary to achieve useful imagery. 
Berth and Brissot?® in 1969 described two 2 um-limited camera tubes. The 
first used a lead-oxysulfide photoconductive target, and the second used a 
mosiac of germanium photodiodes as a target. Both tubes required cryogenic 
cooling, and illumination was required for satisfactory imaging. In 1971, Kim 
and Davern?? demonstrated the feasibility of using one- and two-dimensional 
arrays of indium-arsenide photodiode targets with spectral responses extending 
from 2.5 to 3.4 um. 

Dimmock?? has reviewed much of this work, and has analyzed the 
tradeoffs between some hypothetical infrared camera tubes and conven- 
tional FLIR for varying background fluxes. Dimmock considered both 
mechanically-scanned linear-array vidicons and two-dimensional vidicon tar- 
gets and reached the following theoretical conclusions: 


|. А linear-array vidicon should perform satisfactorily in either the 
3-5 or the 8-14 um band against terrestrial backgrounds. 


2. A mosaic or continuous surface vidicon with responsivity non- 
uniformities of less than 0.04% should perform well in the 3-5 
but not in the 8-14 against terrestrial backgrounds. 


3. In cold space, a two-dimensional vidicon should outperform either 
a linear array vidicon or a conventional scanner. 


Dimmock proposed to subtract background by using an electron flood beam 
to subtract charge uniformly from the target, or by using photoconductive- 
photoemissive targets. 


8.9 Pyroelectric Vidicons 


A pyroelectric material has the property that a change in the material 
temperature produces a change in its electrical polarization. Thompsett?! 
describes the use of this pyroelectric property in tri-glycine sulfate (TGS) to 
create an infrared vidicon. In this scheme a single crystal of TGS is electrically 
polarized by the application of a constant electric field. The front surface 
of the crystal is covered by a thin conductive layer held at a constant voltage. 
Incident thermal radiation produces polarization changes which induce 
localized potential differences between the conducting surface and the rear 
nonconducting surface. The rear surface is scanned by an electron beam 
which deposits charge to neutralize the potential differences. The beam 
current as a function of beam scan position is the analog of the thermal 
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radiation pattern. Since the device is essentially thermal, the temperature 
of the sensing layer must be restored to uniformity prior to readout of the 
next frame if image smear is to be avoided. Thompsett’s pyroelectric vidicon 
therefore uses a mechanical shutter to return the TGS crystal to a neutral 
temperature between frames. Thompsett concluded that a sensing surface 
resolution of 10^ resolvable points per cm? and a thermal sensitivity of 1°C 
are possible at a frame rate of 10 Hertz. 


8.10 Image Converter Tubes 


A few attempts have been made to duplicate the simplicity of operation 
of image intensifiers by developing infrared-sensitive image tubes. Garbuny, 
et al.?? described a tube called a phothermionic image converter. This device 
was thermal and used a scanned light spot to stimulate temperature- 
dependent photoemission from a thin photocathode in the focal plane of 
an infrared lens. It was operated at television frame rates and had good 
resolution, but the minimum detectable temperature difference for large 
targets was only about 10°C. The paper presents an analysis of the limita- 
tions of thermal detectors. 

Auphan, et al.?? discussed a thermal image tube called /e Serval which 
used ultraviolet-stimulated electron emission from a photoconductive cathode 
to produce a visible scene on a fluorescent layer. Ulmer?^ described a 
thermal device in which temperature changes in a thin oil film induced 
reflectivity changes which were made visible by illumination. Choisser and 
Wysoczanski?? developed a thermal image tube called the Bolocon. The 
device operated by UV-stimulated photoelectron emission from a photo- 
cathode deposited on a thin semiconducting glass. Morton and Forgue?® 
described a photoconductive lead sulfide sensing layer operated in a con- 
verter tube. 


8.11 Laser Parametric Image Upconversion 


Upconversion is a technique whereby infrared radiation is mixed witha 
short-wavelength collimated coherent local oscillator beam in an optically 
nonlinear material to produce visible light with the same spatial modulation 
as the infrared light. The electromagnetic theory of upconversion is described 
concisely by Үагіу??, and will not be reproduced here. Laser parametric 
upconversion can occur only in an optical crystal which does not exhibit 
crystal coordinate inversion symmetry so that the electric susceptibility of 
the crystal is nonlinear. In that case an electric field inside the crystal will 
produce an electric polarization which is proportional to the square of the 
field. An incident wave with a frequency отр in the presence of a pump 
frequency Wp will be upconverted to a visible wave with frequency wy 
expressed by 
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Wy = Фур tw (8.45) 


p' 
in such a crystal only if numerous conditions are satisfied. 

The first and most important requirement is that the wave vectors 
must be matched over the crystal interaction length to satisfy conservation of 
momentum, 


Ky = Kip + Kp. (8.46) 


Since the refractive indices for the pump and IR frequencies are necessarily 
different, this can only be satisfied if the crystal is birefringent so that the 
refractive indices for ordinary and extraordinary waves are different. 
Depending on the ratio of the ordinary index to the extraordinary index, this 
allows the pump to be polarized in either the ordinary or extraordinary 
direction, so that it will be phase matched to the component of the infrared 
polarized in the other direction. 

The second requirement is that the crystal must be transparent to all 
three wavelengths involved so that good conversion efficiency is possible. 
The third requirement is that an image amplifier must be available which can 
noiselessly amplify the visible light produced. The visible photon flux cannot 
exceed the incident IR photon flux, and in fact the conversion efficiencies 
are small, so that the visible light produced is faint. The fourth requirement 
is that an optical filter must be available which can filter the high power pump 
beam from the low power visible beam when the two are separated in wave- 
length only by a fraction of a micrometer, | 

Ap 
A, — Ay = —— . (8.47) 
Р У ot MR 


Yariv derives the upconversion power efficiency Py/Pip neglecting 
crystal reflection and absorption losses. It is given by: 


3/2 


P wy’ 02 42 Mg Pb 
qo EIL ( ) — (8.48) 


Р 
IR 2пір Ny ny ео 


where 
Wy = visible frequency 


0 =interaction length 
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n  -refractive index 
Шо = free-space permeability 
e, = free-space permittivity 
РЬ = pump power 
A  -interaction cross-sectional area 
and d = nonlinear optical coefficient defined as given in Yariv. 


Obviously efficiency increases as pump power density and crystal interaction 
length increase. 

Numerous problems arise when the upconversion technique is used to 
produce broad band thermal imagery. The worst is probably that the angular 
magnification of the upconversion process is wavelength dependent and is 
given by 


n n 
б "о 
IR V 
If one selects the material to make this ratio constant for all Хур of interest, 
(i.e., uses dispersion matching) then the image components at all wavelengths 
image in the same place. Otherwise, one wavelength will have one angular 
magnification and another will have a different magnification, which is equiva- 
lent to lateral chromatic aberration. However, if dispersion matching is used, 
a field of view limitation is introduced by the phase-matching condition. 

The material which has been used most successfully for 10 um to 
visible upconversion is Proustite (Ag; AsS; ). The basic elements of an upcon- 
verter system operating in the spatial domain are shown in Figure 8.20. 
Advocates of parametric upconversion as an alternative to conventional 
scanning thermal imaging system propose that image upconversion has the 
following advantages: 


non-sampled (spatially-continuous) imagery; 
elimination of the need for a focal plane cryocooler; 
the potential for noiseless imagery; 

the potential for elimination of IR optics; 
mechanical simplicity. 


л шого 


The primary drawback of the technique, however, seems to be the 
necessity for high pump power. For example, Warner?* concludes that a 
thermal sensitivity of about 1°C can be obtained in a 100 by 100 resolution 
element image upconverter whose pump power times the visible detector 
integration time is 1000 watt-seconds. This clearly is too poor a performance 
for such a large expenditure of power. 
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Milton's?? analyses of image upconversion take into account both prac- 
tical and theoretical limitations, with the conclusion that upconversion is 
unlikely to be competitive with conventional FLIR. The reasons are that: 


l. detective quantum efficiency is too low; 

2. required pump power is too high; 

3. inherent field of view limitations are too restrictive; 
4. a background flux subtraction device is required; 

5. inherent resolution limits exist. 


Milton also concluded that parametric upconversion is most applicable to 
laser infrared radar and to laser pulse-gated active imaging. References 40 
through 42 provide good summaries of this technology. 


8.12 Infrared Quantum Counters 


The infrared quantum counter (IRQC) is an infrared-to-visible conver- 
ter which utilizes radiative energy exchange with electronic energy levels 
to achieve a solid-state imager. The IRQC concept was suggested by 
Bloembergen^? іп 1959, and was analyzed in detail by Esterowitz, еї al.44 
The IRQC is based on the premise that ionic energy levels exist in doped 
rare earth compounds such that transitions corresponding to both visible 
and infrared wavelengths exist. The principle is that an ionic energy level 
transition resulting from the absorption of an infrared photon can be pumped 
to a higher energy level by a laser local oscillator, followed by a spontaneous 
fluorescence of visible light from that level. This simple three level view of 
the IRQC is shown in Figure 8.21. 

The three-level IRQC is impractical to implement for the reasons ex- 
plained by Esterowitz, et al.4*, so the five level IRQC scheme shown in 
Figure 8.22 is used. The material most successfully used to demonstrate 
IRQC action is praseodymium-doped lanthanum trichloride (Pr?* :LaC1 3). 
A sufficient number of transitions exist in this material to allow a broadband 
response in the 3 to 5 micrometer band. 

The quantum efficiency of an IRQC and the output light level are so 
low that an image intensifier or similar device must be used to produce a 
useful image luminance. An effective IRQC would be a complicated device, 
even when compared to a mechanical scanner. To prevent image masking 
by spurious signals from transitions due to the crystal temperature, the 
crystal may have to be cooled. To achieve high IR-to-visible conversion 
efficiency, relatively large pump power must be used, which is inconvenient 
in a spectral region where lasers are few and inefficient. Finally, the IRQC is 
alinearand dc-coupled photon converter, so a background subtraction device 
must be used to improve image contrast. The noise sources of a theoretically- 
perfect IRQC are confined to quantum noise in the IR signal, self-emission 
transition noise, pump noise, and noise in the image intensifier. Limitations 
due to responsive nonuniformities over the crystal face have not yet been 
determined. Experiments with IRQC’s are described in references 45 and 46. 
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Figure 8.21 Simple three-level IROC model. 
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Figure 8.22 Five-level infrared quantum counter. 
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CHAPTER NINE — SAMPLING 


9.1 Introduction 


Sampling effects in imaging systems are equivalent in importance to the 
optical transfer function and the thermal sensitivity in determining image 
quality. Examples of sampling processes are the use of arrays of discrete 
detectors for dissecting images, use of electronic multiplexing or pulse-width 
modulation, use of scan conversion, and relative motion between the scene 
and a framing sensor. Sampling in imagery has been discussed in references 
one through ten. 

As we noted in Chapter Three, two important properties of well-behaved 
imaging processes are spatial invariance of the impulse response and signal 
transfer linearity. When these two properties are not present in FLIR, defec- 
tive imagery may result. As a broad generalization, spatial and temporal 
sampling occurs whenever a system exhibits discrete deviations from invari- 
ance, and amplitude sampling occurs whenever there are discrete deviations 
from linearity. There are two types of sampling of particular importance in 
imaging systems. One is transmission of an object distribution through a 
periodic window function. The other is sampling of the average value of an 
object within each aperture of a periodic aperture array. We will call these 
processes "window" and *'averaging" sampling. 

The structure shown in Figure 9.1 can be used in either type of sampling. 
If the image formed by the system is the product of the object distribution 
with this structure, the array acts as a transmission function and we have 
window sampling. If the image consists of samples (perhaps spatially filtered) 
of the average value of the object within each aperture of the array the pro- 
cess is averaging sampling. Window sampling occurs in electronic multiplexing 
and in motion between an object and a framing sensor. Averaging sampling 
occurs perpendicular to the scan direction in a discrete scene dissection 
system, as in television, laser line scanners, scan converters, and half-tone 
photography. 
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Most sampling problems are not easy to solve using any methods, but 
the solutions are considerably simplified by using the compact Fourier 
analysis notion of Chapter Three, as demonstrated in the following example 
of window sampling. Consider an object denoted by O(x,y) which is alter- 
nately transmitted and obscured by a window sampling function W(x,y). 
The image I(x,y) which results is the product of the object function with 
the transmission function. 

Then 


(х,у) = Оку)  W(x,y) (9.1) 
and the Fourier transform or image spectrum is 


fy fy) = Of fy) + Wy fy). (9.2) 


If as in Figure 9.1, 


Figure 9.1 A periodically transmissive structure. 
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then 


Vf, £j) = Of £j) * | [Sine (a£) - Sinc (afy)] - [Comb (bf,) 


‚ Comb (ы, | | (9.4) 


This image spectrum is shown in Figure 9.2, but no attempt has been 
made to indicate phase shift pictorially. The effect of sampling is the creation 
of sidebands on frequency centers of *m/b and *n/b which contain frequen- 
cies called aliases which are not in the original object spectrum. These aliases 
are merely object spatial frequencies which have been translated in the 
frequency domain by the sampling process. The sidebands are shown 
separated in Figure 9.2, but as b is necessarily greater than a, the sidebands 
overlap. The result is that aliased frequencies interfere with object frequencies, 
and worse, x-frequencies cross-alias, masquerading as y-frequencies, and 
vice versa. 

Averaging sampling is not quite so simple to analyze as window sampling, 
and represents the more interesting sampling problems. Consider a two- 
dimensional object function O(x,y). To demonstrate averaging sampling we 
will find the sampled average value of this function within three important 
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Figure 9.2 Image spectrum through a periodically transmissive structure. 
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aperture functions. First consider the aperture Rect (у/8) shown in Figure 
9.3. Within the Rect function limits we will replace the object value at 
every point (x,y) with the object value averaged over y and evaluated at x. 
Then we will sample this new distribution with a delta function 6(y). Thus 
we have an image 


8/2 
1 
I(x,y) = |i | O(x,y) dy | `6 (у). (9.5) 
-B/2 
We will prove that his expression is equivalent to convolving the object 
with Rect (y/8) and then sampling with 5(y). This is an important point 
because it considerably simplifies analysis of sampled imagery. Rewriting 
equation 9.5 by absorbing the limits of integration into the integrand, we get 


I(x,y) = [+ [ осу · Rect (+) dy | · &(y). (9.6) 


We may change the symbol for the y coordinate to n without affecting the 
value of the integral, giving 


xy) =[F | f O(x,n) · Rect (%) dn | 5(y). (9.7) 


Multiplication of a function by 6(y) has the effect of sifting out the 
value of that function at y = О, so we may rewrite the factor Rect (7/8) as 
Rect ((n—y)/8) without any loss of generality. Then 


I(x,y) = h f O(x,n) Rect (2 т =) an | · 6(y). (9.8) 


As the Rect is symmetrical, 


oo 
юоу-Г- | Оу Rect (2553) an] го) (9.9) 
p 8 
- OQ 
This integral is by definition a convolution integral, so we have 


I(x,y) -т | ооу) Ж Reet(2) | 6(y). (9.10) 
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Figure 9.3 A Rect function aperture. 


This proves that the image value within an averaging aperture is produced by 
a convolution followed by a sampling. The transform is 


Yt, ty) = (об) Sinc (8 t] * 860 


-1 |o ae 
2: (бк, гу) Sinc (В fy) | | (9.11) 

Next consider the infinite array [Rect (у/8) * Comb (y/y)] shown in 
Figure 9.4. If we average the object in y over each of the elementary 
Rect (у/8) apertures and sample with delta functions б(у-пу) with n an 
integer, we get 


co 8/2 + ny 


(xy) = 2, | 


n=- oo 


O(x,y) ау · é(y-n y) | , (9.12) 


ГТ 
Tl 


-p[2 + ny 
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Taking the same steps as in the single aperture case we write 


х,у) = 2, iE || O(x,y) · Rect (=) | m 


E [remm E] m] 


. Reet [ao . Ill 


И 
28 
J 
ГТ 
сој = 
— 
О 
~ 
~ 
З 
N 


= 2 L K ж Rect =») ' б(у—пу) 
n=- оо 8 В 

=L O(x,y) * Rect (5) · Comb (5) (9.13) 
B B y 


Again the effect of averaging and sampling is equivalent to convolution 
with the elementary averaging aperture and sampling by the delta function 
array. The image spectrum is 


(ОК) * Sinc (0 621» [Comb (yf) • 660). 
(9.14) 


1 
8 


Asa final example, consider the sampled average value of O(x,y) within 
each aperture of the array of Figure 9.1, where the array is described by 


[Ree (5) + ве 2] + [em ($) + como (2)] 


(9.15) 
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The image equation 9.16 is 


co со | a/2+nb a/2+mb 
Ky: Y» Y | || | О(х,у) J 


== oo =- oo a 
nr" m -a/2+nb -a/2+mb 


· [8(y—nb) - som] | (9.16) 


It can be shown by extension from the previous two cases that the 
equation can be rewritten as 


(xy) = Б ЕО « | Rect (=) + Rect eJ] 


. (comb " - Comb 5 ) | (9.17) 


This shows that sampling by a two-dimensional array of averaging apertures 
is equivalent to convolution by the basic aperture followed by sampling at 
the lattice points. The transform is 


low 


Tfj) == 104) * Sinc (абу) > Sine (аб)! 


* [Comb (bf ) • Comb (bf,,)]. (9.18) 


The simplifications made in these three examples are what make 
averaging sampling problems manageable. Otherwise the solutions involve 
explicit summations and multiple integrals. It is evident from these examples 
that the effect of averaging sampling by a periodic array of apertures is to 
prefilter the object spectrum before sampling by the delta function lattice 
which is the basis of the array. 

The rest of this chapter is devoted to statements of four practical 
problems and their solutions. In all problems it is assumed that the angular 
magnification of the system is unity, or alternately that image plane spatial 
coordinates are normalized to the corresponding object plane coordinates. 
Scanning systems are assumed to have a linear unidirectional scan, and object 
functions are assumed to include the field of view limits. 
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9.2 Sampling in a Parallel-Processed Scanner 


A common type of thermal imager uses a scanning infrared detector 
array coupled electrically to a scanning light-emitting diode array. As shown 
in Figure 9.5, this system dissects the scene by convolution in the x-direction 
and sampling in the y-direction. Usually the impulse response is invariant in 
the scan direction, and is periodic (non-stationary) in the other direction. 
Figure 9.6 describes this system. The left side of the sketch represents the 
analog channels over which the object is averaged and the right side represents 
the effective sampling locations centered on the raster lines. 

We will determine the image I(x,y) which results from the action of this 
system on an object O(x,y). Taking the process step by step and ignoring any 
optics and electronics spread functions: 


|. О(х,у) is convolved with the detector impulse response rq(x.y) and 
sampled and carried by the electronics channels on optical centers 
y = tny, 


Г(њу) = [O(x,y) «та(ху * Comb >. (9.19) 
2. T is convolved with the video monitor spread function r (x,y). 
I(x,y) = I'(x,y) * r pY). (9.20) 
3. Steps (1) and (2) combine to produce 


(x,y) = 7 « та(х,у)1 * Comb (2)| ктуу). (9.21) 


Denoting the transform of ап impulse response r(x,y) as the optical 
transfer function T(f у» the image spectrum is 


Теби = (ба) ` Talfxfy)] ж Comb(yfy) - О] 
Бобу) | (9.22) 


This image spectrum is shown in Figure 9.7, where the unavoidable sideband 
overlap has been suppressed for clarity. 

The essential features of this result are the creation of sidebands which 
replicate the object spectrum, and the impossibility of perfectly filtering the 
aliased signals from the image without drastically narrowing the image 
spectrum. Note that if sampling were absent, the result would be 
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Figure 9.6 Sampling lattice of the parallel processed system. 
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Tf, fy) = Of fy) auf) (ѓу), (9.23) 


where Tq T, is the assumed system MTF. 

It is important to note that the sampling action may be thought of as 
though the detector scans in the y-direction and is sampled periodically. 
An interesting case of this problem occurs when the detector spread function 
is not the same as the diode spread function, so that the possibility of 
minimizing aliasing effects in the monitor arises. 


9.3 Sampling in a Two-Dimensional Mosaic Staring Sensor 


Image converters for thermal imaging frequently are proposed for thermal 
imaging which consist solely of a two-dimensional mosaic of contiguous 
square elements, as illustrated by Figure 9.8. Such a mosaic averages the 
object over each aperture at the input, samples each aperture once, and 
transmits this value to the output where the image is formed by the con- 
volution of the output aperture with the sampling array. The image equation 
for such a device is 


cx oo Pe) nn] ome) oon 


* [Rect (2) - Rect (+). (9.24) 


{Др 


Figure 9.7 Image spectrum of the parallel processed system. 
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Figure 9.8 A matrix imager. 


Transforming, 
Т, гу) x (бау. fy) * Sinc (of,) + Sinc (ety) | * 
+ [Comb (af,) • Comb СД · {Sine (ot) 


* Sinc (aty)| (9.25) 


This spectrum is shown in Figure 9.9, where sideband overlap has been 
suppressed. The bandlimited object spectrum is replicated in sidebands at 
frequency coordinates of (*n/oa, *m/o), where m and n are integers. These 
sidebands are imperfectly filtered by the reconstruction filter, producing two- 
dimensional aliasing. 

The analysis of Section 9.2 demonstrated that sampling in one direction 
produces aliasing in that direction. The distinguishing feature of the present 
problem is that aliasing occurs in both directions, as well as cross-aliasing. 
That is, x-frequencies masquerade as y-frequencies and vice-versa. These 
effects are evident on close examination of periodic images in color television 
and in half-tone pictures. 
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Figure 9.9 Image spectrum of a continuous mosaic imager. 


9.4 Sampling in a Multiplexed System 


One class of FLIR converts many channels of video to a single channel 
by time-division multiplexing as described in Section 8.4.2. Such a system 
consists of a parallel scanner followed by an electronic switch which sequen- 
tially samples the N identical bandlimited analog channels, and a display 
which demodulates and filters the samples. The description of a multiplexed 
system is identical to that of the parallel processed system prior to step two. 
The spatial equivalent of the time sampling pattern of the multiplexer is 
shown in Figure 9.10. 

The horizontal raster on the left of the figure represents the analog 
channels which are sampled in the x-direction by the multiplexer. The slanted 
raster on the right represents the path of the small rectangular “window” 
function which successively samples each analog channel. This electronic 
window samples each analog channel at intervals a/s for duration a/Ns, 
where s is the number of samples per dwell time (assuming unity multi- 
plexer efficiency). We will approximate the angle the sampling raster makes 
with the horizontal by 90°, because the exact equations are cumbersome 
and the exact results are not worth the added effort. The approximate 
sampling raster we will use is shown in Figure 9.11. 
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Figure 9.11 Approximate sampling raster for the electronic multiplexer. 
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Let the impulse response of the system prior to the multiplexer be 
solely due to the detector response rg(x,y), and let the impulse response of 
the demodulator and ‘final filter (the monitor) be г. The multiplexer 
window function is 


W(x) = Rect (<) « Comb (25). (9.26) 


The steps іп the image construction аге: 


1. Ап image I'(x,y) is formed by the usual one-dimensional raster 
sampling, 


I'(x,y) = [О(х,у) * гд(х,у)] * Comb (7) (9.27) 


2. T is sampled by the window function W and convolved with the 
multiplexer channel impulse response, which we will assume is 
negligible, producing 


l'(x,y) = 1(х,у) • М(х,у) . (9.28) 


3. I” is convolved with the combined demodulator and monitor 
impulse response r,, to produce a final image, 


I(x,y) = х,у) * rq Goy). (9.29) 
Combining these, 


Ixy) = |ОСњу) = rqG)] * Comb (5) у) sry). (9.30) 


Transforming, 
I(fysfy) = (ба) af. f] » (Comb (вгу) ке.) * Welty) 


a Феб). (9.31) 
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The window function transform is 

Тү, — с: 2 . x . 

W(f, fy) = Sinc (55 fx) С fx} 208) | (9.32) 
In the neighborhood of f, = 0 and for N large, 

W ~ a . 

Моћ) = Comb (5 fx) 8(fy). (9.33) 


The image transform becomes 
Те, у) = | O(f x fy) . табу) ж (Comb (8 fy) . 5(f,)] 


s [Comt (2 fx) . à) | af) (9.34) 


or 
ТЕ, fy) = | [бебу · та Л, * [соть (гу) ' comb С 5) 
"s (9.35) 


This image spectrum is shown in Figure 9.12. The essential element of 
this analysis is again the existence of sidebands which replicate the pre-sampled 
spectrum. The analysis of these first three problems may be extended to the 
case of a scan converter viewing another framing sensor. 

As a challenging and practical exercise in the use of this analysis, the 
interested reader could describe the operation of a pulsewidth modulator. 
This device samples each analog channel periodically for a duration briefer 
than a dwell-time, and converts the sampled values to constant amplitude 
pulses with the same period as the sampler and with pulse widths propor- 
tional to the sampled values. The pulse width modulator then drives a 
scanning diode display. 


9.5 Practical Consequences of Sampling 


The most deleterious sampling effects commonly found in thermal 
imagers are time sampling of image motion due to the framing action and 
spatial sampling due to the raster scanning process. Motion sampling causes 
severe corruption of the shapes of small targets moving rapidly perpendicular 
to the scan direction, but spatial sampling produces by far the more objection- 
able results. Biberman!! presents an extensive discussion of raster sampling 
effects, and cites a study by Thompson!? , who determined the distribution 
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Figure 9.12 Image spectrum of a multiplexed system. 


of preferred viewing distances for rasterless and rastered television imagery. 
Thompson found (see also Section 4.8) that the screen angle subtended at the 
mean preferred viewing distance with raster was about 8^, and that without 
raster, observers tend to halve their viewing distance to get a screen angle of 
about 16?. With about 480 active lines on standard television the 8? field 
gives about 1 arcminute per raster line, which is consistent with the maxi- 
mum raster line subtense found to be tolerable by many other sources. 

The significance of this is that when viewing raster-degraded imagery, 
viewers tend to increase the post-sampling spatial filtering effect of the visual 
system in order to attenuate the spurious high frequencies introduced by the 
raster. Consequently, viewers reduce the visibility of desired signal frequen- 
cies as well as that of undesirable ones, and the overall performance is 
degraded from what is achievable using a rasterless system. 

Another consequence of raster sampling is loss of limiting resolution in 
the direction of the raster. This is expressed by the Kell factor K, which for 
standard television relates the vertical number of resolvable lines Ry per 
picture height to the number of active scan lines Ма by Ry = К Ng. 
According to Luxenburg and Kuehn! 3, experimental values of K have a range 
of 0.53 < K < 0.85, with a value of 0.7 being assumed in American tele- 
vision practice. 
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Another problem is that raster tends to catch the eye in free search, and 
to slow it down by distracting it from targets. The net result of sampling is that 
objects embedded in the fixed image structure of sampled system are harder 
to detect, recognize, and identify than in a nonsampled system with the same 
nominal resolution and sensitivity. This is difficult to believe for many of us 
because we аге so conditioned to interpreting a television set as "good" when 
we see a Clear, sharp raster. 

A common raster defect in multi-element systems is an inoperative 
channel due to a detector, an electronics channel, or a display failure. Such 
a failure is usually dealt with either by leaving the affected raster line blank, or 
if possible by driving that line with the signal from either adjacent line. When 
several lines are missing, these solutions are not satisfactory, as they result in 
image features which continually distract the attention of the visual process 
away from the defect-free parts of the image. It has often been demonstrated 
that generating an artificial signal consisting of the average of the two channels 
adjacent to the dead channel produces imagery so good that it is difficult to 
find the defects, so long as the percentage of dead channels is low. 

From the definition of OTF in Chapter Four, and from the present 
discussions, it is clear that the OTF concept applies only for image directions 
for which there is no sampling. Consequently some measure of sampled image 
quality such as the relative percentage of image power attributable to aliased 
frequencies must be used to describe sampling defects. Another possible 
measure is the resolving power for bars oriented parallel to the scan direction. 
At the time of this writing, no satisfactory summary measure of sampled 
image quality has been identified. 
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CHAPTER TEN — VISUAL TARGET ACQUISITION 


10.1 Fundamentals of Target Acquisition 


The process of searching the display of an electro-optical imaging sys- 
tem for a target consists of four interrelated processes: detection, classifica- 
tion, recognition, and identification. Detection is the discrimination of an 
object from its background and its assignment to the class of potentially in- 
teresting objects. Classification is the assignment of the detected object toa 
gross class of objects such as vehicles or combatant vessels. Recognition is the 
assignment of the classified object to a specific subclass such as tanks or des- 
troyers. Identification is the assignment of the recognized object to an even 
more specific category such as M-60 tanks or Spruance class destroyers. 

The user of a thermal imager typically expresses the performance of the 
system by a single acquisition probability P[ Acq] which gives the probability 
of accomplishing an assigned search task. We may write P[ Acq] as a product 
of conditional probabilities if we denote the various conditions as follows: 


NOTATION MEANING 
In Target appears in the search field 
Look Target is looked at by observer 
Det Target is detected by observer 
Clas Target is classified by observer 
Rec Target is recognized by observer 
Iden Target is identified by observer 


Then 
P[Acq] = P[Iden/Rec, Clas, Det, Look, In] * P[Rec/Clas, Det, Look, In] 
• P[Clas/Det, Look, In] • P[Det/Look, In] 


• P[Look/In] • P[In]. (10.1) 
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The probability that the target is in the field is a complicated function 
of cueing, foreknowledge, navigation, and the search mode chosen. Thus one 
usually assumes P[In] = 1. One also usually assumes that each search task is 
independent of the other so that the conditional probabilities simplify to 
products of unconditional probabilities, giving 


P[Acq] = P[Iden] P[Rec] P[Clas] P[Det] P[Look]. (10.2) 


The complexity of the search process may be readily appreciated by 
considering the following lists of pertinent factors culled from the literature. 
There are fourteen significant displayed target characteristics: 

signal-to-noise ratio (SNR) 

contrast against the background (C) 
critical angular subtense (6,) 

edge gradients 

contour complexity 

context in the scene 

location on the display 

shape 

orientation 

perspective 

size relative to the display size 
velocity of motion through the scene (V) 
Juminance 

state of operation. 


There are four significant displayed scene characteristics: 


background luminance (Lp) 
clutter or false-target density 

rate of motion through the display 
scene aspect ratio. 


There are six significant system characteristics: 


optical transfer function and related resolution measures 

sampling rate in time (F) 

sampling rate in space (scan lines L per critical target angular 
subtense) 

video monitor screen angular subtense (A’) 

grey scale rendition 

dynamic range. 


There are eleven significant observer states: 


training 
motivation 
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fatigue 

prebriefing 

age 

IQ 

personality 

workload 

search technique 

number of observers and the degree of communication between 
them 

peripheral visual acuity. 


There are at least five other tactical and miscellaneous factors: 


required search area 
cockpit illumination 
vehicle noise and vibration 
image or observer vibration 
allowed search time. 


These factors total forty, and there may be many others. All of these 
factors have been investigated to some extent and subsequent sections sum- 
marize some of the more useful experimental findings. Every visual search 
process paper in the literature obtains an experiment of manageable complex- 
ity by isolating a few factors from the others. Yet though there are scores of 
papers in this field, no two results of similar experiments are ever quite the 
same. The best one can hope to obtain from this tangle of data are rules of 
thumb which guide one in the proper direction. 

Most of the experiments we will consider here select an image quality 
variable V, fix all of the other possible image quality variables, and vary V to 
determine its effect on the probability of success of a particular task T asa 
function of the variable, Py[T]. These experiments generally assume that the 
effects of such parameters as noise, resolution, sampling, and target type are 
approximately separable. In that case, the maximum value of a particular 
РУТТІ may not be unity because the maximum achievable probability may 
be limited by the fixed variables. 

It is the author’s opinion that field performance predictions are use- 
ful for understanding in a general way how variables affect РТ] , but that 
attempting to predict field test results accurately is a waste of time. The best 
way to approach the problem of performance prediction is to conduct intel- 
ligently designed field tests and simulations, and to extrapolate to unmeas- 
ured conditions using general principles. 

It is in that spirit that the following sections are offered. They outline 
some reliable experimental results which illustrate general target search prin- 
ciples. There are few published studies concerning thermal systems, but there 
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are many published photographic and television experiments using imagery 
sufficiently similar to thermal imagery to be of interest. We will examine the 
results of these visible-spectrum studies before considering the existing ther- 
mal imagery studies. The best single source of information on visual target 
acquisition is the text by Biberman!. 


10.2 Eye Motion During Visual Search 


The process of searching a visual field for a target is characterized to a 
great extent by the limitations on the rapidity and frequency of eye move- 
ments. In free search, the center of the visual field stops for a brief period 
called a saccade, rapidly moves to another position, fixates again, and con- 
tinues the process until detection is achieved. The rate at which a visual field 
can be searched is severely limited by the number of fixations possible per 
second and by the search pattern, which is usually selected sub-consciously. 
The following discussions explore these problems, considering first the mech- 
anism of search, and then the equations describing it. The key papers describ- 
ing the mechanism of search are those by Enoch? ; Ford, et al.’ ; and Baker, 
et al.^. 

Enoch? used circular aerial maps with angular subtenses of 3°, 6°, 9°, 
18°, and 24? at a viewing distance of 21.6 inches, and of 51° 18' at 13.25 
inches. The maximum scene luminance was 60 fL. While observers searched 
these displays for a Landolt C, Enoch tracked their eye movements and found 
the following: 


1. most eye fixations tend to fall in the center of the display 


2. more are directed to the right half and to the bottom half of the 
display than to the rest of the display 


3. fixation time decreases and the interfixation distance increases as 
display angle increases, as shown in the following table. 


Average Average 
Display Fixation Interfixation 

Angle Time Distance 
3 0.578 0.87° 
6° 0.468 1.82° 
9° 0.384 2.13° 
18° 0.361 3.72° 
24° 0.355 4.33° 


51° 18' 0.307 6.30° 
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4. As the display angle falls below 9°, the search efficiency drops 
drastically because the number of fixations off of the display in- 
creases, as shown below: 


Percent of Fixations Falling 


Display Angle Off of the Display 
>9° 10 
6° 50 
3° 75 


5. Displays larger than 9° induce an increase in central fixations at 
the expense of peripheral fixations, with a consequent reduction in 
search efficiency. 


6. The upper left hand display corner is the least often inspected area. 


Enoch concluded that the optimum display angle is 9° due to the effi- 
ciency losses of observations (4) and (5). The moral of this investigation is 
that a display subtense greater than 9^ is justifiable only if the excess is used 
primarily for orientation and navigation. 

Ford, et al.?, conducted a visual search experiment to determine the 
rate of eye fixations, the fixation duration, and the interfixation distance. 
Searching a 30° field twelve inches in diameter at 2.5 fL in 5 seconds for a 
near-threshold 1/8 inch dot, they found a range of 2.2 to 4.4 fixations per 
second, with a mean of 3.1 per second. The mean fixation duration was 
0.28 second so that fixation occupied 85% of the time. The mean time in 
motion was 0.04 second giving 15% of the time in motion. The mean inter- 
fixation distance was 8.6°. They also found that the central and outer por- 
tions of the field tended to be neglected in search. A summary of eye 
movements during search is given by White?, who asserts that fixation time 
increases as the performance requirements or the scene complexities increase. 

Baker, et al^ conducted experiments with static displays containing 
arbitrary forms. The clutter density, or the relative number of irrelevant 
forms, was constant so that the number of irrelevant forms was proportional 
to the display area. They found that the search time behaves as shown in 
Figure 10.1. Conversely, when the number of competing objects is independ- 
ent of display size, the search time is unchanged as the display size is in- 
creased. They also divided a circular static display into five contiguous 
annular regions of equal width as shown in Figure 10.2, and found that the 
targets were found the fastest in region 3, as shown in Figure 10.3. 
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Figure 10.1 Search time as a function of the area of the display 
to be searched (adapted from reference 4). 
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Figure 10.2 Classification of target locations on the display used 
in the study of reference 4. 
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Figure 10.3 Search time as a function of target location, adapted from reference 4. 


10.3 Free Search Theories 


We will now apply the concept of discrete glimpses or fixations to the 
problem of calculating the probability of looking in the correct direction on 
a display for a single target. 

The most efficient possible search procedure would be structured, but 
not necessarily ordered, so that successive glimpses at the display were non- 
overlapping and completely covered the display periodically without repeti- 
tion. The search equation for that case is derived as follows. Let the solid 
angle subtended by a single glimpse be Оо and the total search solid angle 
subtended by the display be ©. Then іп one complete structured search of a 
field, there will be о/о fixations. The total possible number of glimpses 
in which the target is looked at is 1, and the possible number of glimpses in 
which the target is missed is (9/9) - 1. 

The problem as stated is thus equivalent to finding the probability of 
selecting a black bean in n removals without replacement of beans from a 
jar containing “а” black beans and b white beans. The possible number of 
unfavorable events is b. For this binomial distribution problem, the prob- 
ability of n - r favorable events occurring in n trials or selections is: 


P(n-r) = (т) pot аг. (10.3) 
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The probability of a favorable event in one trial is p, and q is the prob- 
ability of an unfavorable event in one trial. These are related to a and b by: 


2-3. 
and 
-b 
а= = | (10.5) 


In this simple search model, the total number of trials п is the ratio of the 
total allowed search time t to the glimpse time tg, 


- 
|| 


(10.6) 
5 


For the case that the target is detected when it is looked at, the desired 
number of favorable events n-r is 1, and 


Gs 
a-landb-g*-l. 
5 
Тћеп, 
Qe 
ро. (10.7) 
5 
апа 
o 
а=1— <>, (10.8) 
S 
and 
t/t, А 1 А tity — 1 
_| t 2, 5 
Р) 0-10 Noe] Vinx (10.9) 
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Since there is only one possible favorable outcome, instead of using the 
above equation we may evaluate P(1) more simply by: 


P(1) = 1 — P(0) 
t/t 
[ty o, V Q t/ty 
= 1 __ __95 1 2225 
a) Qs 
t/ty 
о \ ttg 
_ g 
=] — | E (10.10) 


The defects of this simple model are that free search is neither struc- 
tured nor nonrepetitive so that the search efficiency is sub-optimal, and that 
detection does not necessarily occur when the target is looked at. Better 
models have been discovered or derived in the three primary papers on the 
theory of search by Krendel and Wodinsky?, Williams and Borow’, апа 
Williams? . 

Krendel and Wodinsky? postulated a random search model and validated 
it by three experiments. They assumed that search is random and that if the 
probability of detection in a single glimpse is Poo the probability of detection 
after n glimpses is: 


P=1—(1—P,,)" (10.11) 


P= 1 — exp [n #п(1—Р (10.12) 


for n large and P,, small. 


Letting the glimpse time be ty and the total search time be t, then 
n= t/t, and: 


P(t) = 1 — exp[t &n(1—P.,)/ty] = 1 —exp(-mt). (10.13) 


Their first experiment used a 72 x 63-inch screen of luminance 5 fL 
viewed at 10 feet, giving a display subtense of 34.92 by 30.44 degrees. The 
target was a near-threshold 3.87 milliradian circle which was located in one 
of 48 possible target locations. This experiment strongly supported the 
simple search model. 
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The second experiment used a 1.2 mrad? target against a large 0.01 fL 
background. The search areas were 0.22 or 0.098 sr. Dark-adapted observers 
searched for 44 randomly located targets, and the results also substantiated 
the model. 

The third experiment was a thirty second forced choice search by dark- 
adapted observers of fields 6.8°, 18°, 32°, and 43° in diameter and at back- 
ground luminances of 12.4, 1.03, 0.1, and 0.01 fL. The targets were circles 
of angular diameters 1.41, 3.81, 7.03, and 13.48 milliradians, and all target 
contrasts were at least twice the 95% threshold of detection for long viewing 
time. They concluded that the exponential search model is satisfactory under 
three conditions: 


1. the search time does not exceed 30 seconds 


2. the image parameters such as contrast or size are constant during 
search 


3. the observer does not consciously use a nonrandom search pattern. 


At the time of their investigations the pattern of eye fixations was not 
known, so their model could not account for this. Their experiments also 
indicated that training (practice at the task) had no statistically significant 
effect on the test results. 

Williams and Borow’ verified for different clutter densities and. different 
rates of scene motion through the display that the recognition probability 
takes the simple form P= 1—exp(—mt).Williams® developed a somewhat dif- 
ferent search theory based on the fact that successive fixations within an 
interval of a few seconds tend to be clustered, resulting in a semistructured 
search process. Defining N as the number of complete scans of the field com- 
pleted during a given search experience and F as the fraction of the last scan 
completed before the desired detection occurs, Williams proposed that the 
probability of detection in (N+F) scans is: 


P(N+F) = 1 — (1—P,)N + ЕР, (1-Р), (10.14) 


where Ре, is the probability of detection in a single complete scan of the 
field. 


When the total search time t is an integral multiple of the single scan 
search time t.., Williams has: 


SS? 
P(t) = 1 — exp(—mt) 


where 


m = — fn(1—P,./t... (10.15) 
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In all of the above equations, the time t allowed for search can be related to 
physical search parameters such as sensor slewing rate or aircraft ground 
speed, sensor depression angle, sensor field-of-view, and sensor altitude as 
demonstrated in section 10.4. 

Bloomfield? showed that search in a densely cluttered scene is facili- 
tated when target-to-clutter contrast is increased, or when target-to-clutter 
size is increased. Petersen and Dugas!’ similarly found that search times are 
reduced when target contrast or rate of motion relative to the background 
are increased. In the case of target motion, they found that the free search 
equation must be modified by the addition of a factor to the exponential. 
For target velocity V through the scene of less than 5 degrees/second, this 
factor is C(1 + 0.45 V?) where 0.3 € C € 0.5. 


10.4 Search Geometry 


Parametric tradeoff analyses cannot be performed without reference to 
some assumed search pattern or procedure. A simple example is that of 
terrain search from a moving aerial platform. To simplify the equations, we 
will assume that the sensor altitude is constant and that motion relative to 
the ground is in a straight line. 

Let A be the sensor azimuth field of view, B be the sensor elevation 
field of view, 04 be the sensor declination angle measured from the horizon- 
tal to the center of the sensor field of view, H be the sensor altitude and У, 
be the sensor ground speed. These quantities are shown in Figure 10.5. 
together with the ground dimensions intercepted. 

The quantities are related by: 


H/R, = sin (04 — B/2) (10.16) 
Н/Е, = sin (04 + B/2) (10.17) 
H/D, = cot [90° — (да + B/2)] (10.18) 
H/D, = cot [90° — (64 — B/2)] (10.19) 
W,/R, = 2 tan (A/2) (10.20) 


W,/R; = 2 tan (A/2) (10.21) 
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The ground area within the field is 
G = (D; —D,)/[(W2 + W,)/2). (10.22) 


If a target enters the field at point 1 shown in Figure 10.5, and exits at point 
2, the time t it is in the field is: 


D; —D, 
су . (10.23) 


10.5 Equivalent Bar Chart Resolvability 


Numerous investigators have demonstrated that the single most im- 
portant factor in determining the level of target discrimination possible with 
a system is the resolvability of various bar chart equivalents of the target. 
This appears to be the case regardless of the nature and combination of the 
image defects involved. Figure 10.6 depicts this equivalent bar chart con- 
cept, where it is assumed that a target is characterizable by a critical target 
dimension which contains the target detail essential to discrimination. This 
dimension typically is the minimum projected dimension of the target, as 
shown in Figure 10.7. The bar chart equivalents of the target are the mem- 
bers of the set of bar charts whose total width is the critical target dimen- 
sion and whose length spans the target in the direction normal to the critical 
dimension. 

Figure 10.8 shows some examples of how various image degradations 
affect performance. The target is the same in all cases and is characterized 
as having an average apparent blackbody temperature difference AT against 
the background. The target has a critical angular subtense 0с so that it sub- 
tends a fraction of the system field of view A given by ӨС/А. In Figure 10.8, 
system 1 is random-noise-limited, system 2 is magnification-limited, system 3 
is MTF-limited, and system 4 is raster-limited. The way the degradations have 
been selected, 4 is better than 3, 3 is better than 2, and 2 is better than 1, as 
evidenced by the target imagery. These degradations allow detection with 
system 1, classification with system 2, recognition with system 3, and identi- 
fication with system 4. 

The theory relating equivalent bar target resolvability to target dis- 
crimination is that the level of discrimination can be predicted by determin- 
ing the maximum resolvable frequency of an equivalent chart having the same 
apparent AT as the target and viewed under the same conditions. This theory 
іп a somewhat different form was first proposed by Johnson! ! , who sought 
a way to relate in-the-field performance of image intensifiers to objective 
laboratory measures. 
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Johnson determined the number of resolvable cycles subtended by the 
critical target dimension for eight military vehicles and a standing man. The 
now well-known Johnson criteria averaged over all target classes are: 


Number of resolvable cycles 
required for 50% probability 


Level of discrimination of correctness 
detection 1.0 € 0.25 
orientation 1.4 € 0.35 
recognition 4.0 + 0.8 
identification 6.4 + 1.5 


These criteria were derived directly from image interpretation data, and 
their validity is in the strictest sense limited to rasterless image-intensifier- 
type imagery. 

Rosell and Willson!? refined the theory by applying the perceived 
signal-to-noise ratio concept described in Chapter Four. They derived expres- 
sions for the resolvability of bar targets embedded in additive gaussian noise 
on a television monitor, and performed tests to verify the theory. They then 
added noise to television images of vehicle targets, conducted tests with 
Observers to determine recognition and identification probabilities as func- 
tions of perceived SNR, and plotted these probabilities as functions of the 
predicted perceived SNR's required to resolve the equivalent bar patterns of 
each target. They assumed that a seven-cycle equivalent bar pattern is re- 
quired for recognition and that an eleven-cycle equivalent bar pattern is re- 
quired for identification. 

The results support the simple theory that target recognition and 
identification are predictable from equivalent bar chart resolvability. The 
complete experiments and results are described in Biberman! . These data are 
strictly applicable only for targets embedded in noisy but otherwise perfect 
525-line television imagery and may or may not be valid for the wide variety 
of FLIR formats which are used. Williams! 3 performed similar experiments 
with computer-processed FLIR imagery and found that recognition accuracy 
correlates well with the number of resolvable cycles on the critical target 
dimension. These results apply regardless of the combination of raster, 
resolution, contrast, and noise used, and are described in detail in section 
10.9. 


10.6 Probability of Detection 


The probability of detection of simple geometrical targets featured 
against uniform backgrounds and embedded in random noise was discussed 
in Chapter Four. The conclusion there is that the visual system operates as if 
it calculates a signal-to-noise ratio and compares it with a threshold value of 
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SNR as a test of significance. There is considerable data to support this 
theory in various regimes of viewing. The theory is supported in the 
quantum-noise-limited or contrast limited mode by the data of Blackwell! *, 
and in the additive noise mode by the data of Coltman and Anderson! $, 
Schade! 5, and Rosell and Willson!’. Virtually no data have been published 
relating detectability of non-simple targets in non-uniform backgrounds to 
signal-to-noise ratio. 

Chapter Four asserted that the probability of detection increases as 
target viewing time, angular subtense, and contrast increase. The tests of 
Bernstein! 7; Coluccio, et а1.! $; Hollanda and Harabedian!? ; and Greening 
and Wyman?? with real targets and situations showed that in practical tasks, 
the percentage of targets detected does indeed increase as target contrast in- 
creases. Bernstein!?, for example, found that CRT images of vehicles and 
personnel required 90% contrast (C 4 (LT—Lp)/Lp) to reach the maximum 
attainable recognition probability. 

Bernstein!’ found that resolution affects detection probability only 
insofar as it influences signal-to-noise ratio or target contrast. However, 
Coluccio, et al.,!5 found that detection completeness improves with limit- 
ing resolution R [resolvable cycles/target dimension] by: 


completeness = K, log K,R, (10.24) 


where K, and К, are empirical constants. 

Bailey?! used the search theory discussed earlier as a point of depart- 
ure, and cited many references which indicate that the cumulative probabil- 
ity of detection P[Det] is a function of the clutter. When a cluttered scene 
of displayed solid angular subtense О must be searched for a target of 
displayed angular subtense S27 in time t, Bailey asserts that the cumulative 
probability of detection is given by: 


cumulative P[Det] = P[Det/Look] [1 — exp (—6.9 t 01/0, КІ. 


(10.25) 


Here P[Det/Look] is the probability that if an observer looks directly at the 
target, he will detect it, and K is an empirical clutter factor proportional to 
the density of false targets in the scene. Typical values for K in the literature 
surveyed by Bailey range from about 0.01 to 0.1. The experiments of Baker, 
et al.^, and of Williams and Borow? support this theory. 
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10.7 Probabilities of Classification and of Recognition 


To the author’s knowledge, no data has been published describing the 
image quality necessary to convert a detection to a classification. This 
neglect is not, however, caused by the unimportance of classification as an 
imagery interpretation task. Classification is particularly important in a mili- 
tary context. For example, it is common that a certain type of target such as 
a vehicle is not supposed to be in a secured area or that a bogey is known to 
be in a certain sector. In such cases, it is necessary only to detect the target 
and to classify it simply as being a vehicle or an aircraft before firing on it. 

The probability of recognition of simple and of practical targets has 
been exhaustively investigated for the effects of variations of a single image 
quality variable. Barnard*? found the probability of recognition of randomly 
oriented Landolt C’s, and of asterisks with one missing spoke, when masked 
by additive Gaussian noise. He found that the results are accurately described 
by assuming the eye-brain system operates as an optimum filter, which sup- 
ports the contention of some visual psychophysicists that the eye-brain 
system consists of an assembly of stochastically independent narrow-band 
tuned filters. 

There is agreement among all investigators that improving the resolution 
of the viewing device improves recognition and identification performance. 
Bailey?! suggests that probability of recognition P [Rec] is related to the 
smallest number of resolution cells (90% detectable spots) contained in the 
narrowest target dimension. Johnson!! reported that for high confidence 
the number of resolvable cycles per critical target dimension should be 4 + .8 
cycles for recognition. 

Bennett et al.,?? found that recognition performance improves at 15 
inches viewing distance as the displayed resolution improves to approxi- 
mately .25 mrad, and that no significant improvement occurs beyond that 
level. Greening and Wyman?? recommend 


r, y 
P[Rec] = exp E) | (10.26) 


where r, = displayed solid angular resolution of the sensor including the eye 


5 


іп mrad’, 


т = empirically required resolution [mrad? | for a particular target, 
m = 1 for r,/r > 1, 


m= 2 for r,/r = 1, 
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От = target solid angular subtense, 


Np = number of resolution elements on target required empirically. 


Their experiments with real scenes indicated that the number of resolu- 
tion elements required for 90% confidence varied from 3 to 20 depending on 
target complexity. Boeing Company investigators? * found that photographic 
information extraction improves approximately linearly with the area under 
the MTF curve and with Ne for the typical types of MTFs they investigated. 

Some of the most consistent results in search theory are obtained in 
investigations of the number of spatial samples or raster lines across a target 
required for recognition and identification. Johnston's?? results for TV on 
the probability of recognition P[Rec] of vehicles as a function of the number 
of raster lines L through the target are fitted well by 


P; [Rec] = 1 - exp |-018 (L+1) (10.27) 


Гог 7< LS 13 lines. 

Numerous field experiments using thermal systems yield a range of 4 + 1 
lines on target for 50% P[Rec] and 6 + 1 lines on target for 90% P[Rec]. 
Scott, et al.,2° generated vertically sampled imagery of model military 
vehicles with 4, 6, 9, 13.5, 20, and 30 scan lines per vehicle height. Obser- 
vers were asked to match the vehicle images with the unsampled images, and 
the probabilities of correct vehicle recognition were computed. Their results 
are summarized by Figure 10.9. Hollanda and Harabedian!? performed 
similar experiments and obtained similar results. 

Gaven, et al.,?" investigated information extraction using photographs 
sampled at equal intervals in both directions with a Gaussian spread function 
sigma of one-half of the lattice spacing. They used values of the number of 
scan lines per vehicle diagonal of 22.1, 33.1, and 49.6. The image intensities 
were quantized into approximately equiprobable levels of 1 to 7 bits (2 to 
256 levels). The probabilities of recognition using the processed images were 
determined as a function of L and the number of bits in intensity, and are 
shown in Figures 10.10 and 10.11. They found no significant improvement 
in performance beyond 3 bits. Johnston?? , on the other hand, using closed 
circuit TV, found no significant dependence of recognition on shades of gray 
on a TV monitor for shades of gray between five and nine. 
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Figure 10.9 Variational bounds on probability of recognition versus lines subtended 
for all classes of vehicles. (Adapted from reference 26) 
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Figure 10.10 Probability of identification versus quantization level and number of 
scan lines per target. (Adapted from reference 27) 
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Figure 10.11 Constant probability of identification versus quantization level and lines 
per target. (Adapted from reference 27) 


Steedman and Baker?* investigated the dependence of target recogni- 
tion on target angular subtense using circular static photographic displays of 
angular diameters 4.6, 9.3, 13.9, and 17.7 degrees when viewed at 24 inches. 
Each display contained 184 arbitrary complex forms selected from a universe 
of 557 forms. Twenty-four forms were designated as targets to be searched 
for, and an illumination of 20 foot-candles was used. All forms had high 
contrast and were only slightly blurred. Their results indicated that the per- 
cent recognition error and the search time are relatively constant for target 
angular subtenses larger than 12 arcminutes, and increase sharply below 12 
arcminutes. Figure 10.18 shows the percentage change in search time and the 
percentage of observations in error as functions of the maximum target 
angular subtense. 


10.8 Probability of Identification 


Hollanda, et al.,?? investigated the dependence of target identification 
on the number of line scans per target and the point signal-to-noise ratio. 
They used twenty model armored vehicles and generated simulated electro- 
optical imagery. The number of scan lines L per vehicle height was varied 
over 16, 32, and 48 scan lines per vehicle, and Gaussian noise was added to 
get image SNRs of 3, 5, 10, 20, and 30. Among other things, they plotted 
percentage correct identifications versus SNR for different L's and plotted 
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Figure 10.12 Percentage change in search time and percent change in identification 
errors versus target angular subtense. (Adapted from reference 28). 


L versus SNR for constant identification performance. Their identification 
probabilities P[Iden] are shown in Figures 10.13 through 10.16 as functions 
of L and SNR. Their basic conclusions were that: 


1. P{Iden] for vehicles increases as a strong function of SNR values 
up to 15. 

2. The P[Iden] for noiseless images were only 5% better than those 
for images with SNRs of 30. 

3.  P[Iden] increased at most 10% when SNR increased from 20 to 
infinity. 

4. The differences in image quality among images with SNRs of 2, 3, 
and 4 were negligible. 


The previously described experiment of Scott et а1.26 with line scan 
imagery produced the dependence of P[Iden] on scan lines per target shown 
in Figure 10.17. They found only slight improvement in percent correct 
identifications beyond L = 20, and the 50% point occurred at approximately 
L = 7. Variations іп P[Iden] with oblique 45° viewing and with nadir viewing 
are shown in Figure 10.18. 
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Figure 10.13 Identification versus lines for point signal to noise ratio variations 
with miscellaneous military vehicles. (Adapted from reference 29). 
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' Figure 10.14 Identification versus lines for point signal to noise ratio 
variations with tanks. (Adapted from reference 29). 
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Figure 10.15 Identification versus point signal to noise ratio for line 
variations for miscellaneous military vehicles. 
(Adapted from reference 29). 
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Figure 10.16 Identification versus point signal to noise ratio for line 
variations for tanks. (Adapted from reference 29). 
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Figure 10.17 Variations in identification for all vehicles versus lines per 
vehicle height. (Adapted from reference 26). 
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Figure 10.18 Identification versus lines per vehicle height for all vehicles. 
(Adapted from reference 26) 
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Hemingway and Erickson's?? data relating probability of identification 
of arbitrary typographic symbols on a television monitor to the symbol 
angular subtense 0. in milliradians and to the number of TV scanlines L 
through the symbol are fitted: well by P[Iden] = 1 — exp (6.4 X10? L 0,)?. 


10.9 Experiments with Computer-Processed FLIR Imagery 


The economic impracticality of constructing FLIR's which exhibit wide 
variations in performance parameters suitable for use in image interpretation 
studies has led to simulation of these effects with processed imagery. One 
such study by Williams! ? determined the effects on recognition accuracy of 
lines per target, magnification, MTF, and SNR. 

Williams and his associates constructed a digital image synthesizer and 
used it with thermographic imagery to produce controlled image quality 
degradations. The purpose of these efforts was to simulate the image charac- 
teristics of conventional raster-scan FLIRs, to use the resultant imagery to 
quantitatively ascertain the effects of parametric changes on observers' recog- 
nition performance, and to identify summary measures of performance. 

The original thermographs were essentially noiseless, had a 2-to-1 aspect 
ratio and 250 scan lines per picture height, and exhibited a gaussian MTF 
with an LSF standard deviation of one-half the scan line spacing. These 
photographs were processed so that the degradations in the originals were 
insignificant compared to the processing degradations. Each photograph was 
reduced to a matrix of 250 by 500 sample points recorded in six bits per 
point or 64 levels. The data were sampled as by a line scan process, spread in 
two dimensions as by a gaussian spread function, degraded by controlled 
noise, and printed out on film by a raster having a gaussian spread. Sixteen 
examples of each of ten target types were processed in this way. The follow- 
ing variations in image parameters were produced: 


Raster lines per picture height: 11, 22, 44, 88 

Raster lines per target height: 3, 6, 12, 24 

Gaussian line spread function sigma 0.38, 0.48, 0.55, 
as a fraction of the line spacing: 0.61, 0.67, 0.72 

Corresponding gaussian MTF's at 0.47, 0.33, 0.23, 
a frequency of one-half of the 0.16, 0.11, 0.08 
line frequency: 

larget image SNRs: 25, 12.2, 6.58, 

3.50, 2.04, 1.35 


lhese processed pictures were then viewed at varying distances to 
produce the following variations in magnification: 
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Picture heights in degrees: 0.5,1, 2,4 
Raster line angular subtense in 

milliradians: 19.8, 9.9, 4.95, 2.48 
Target angular subtenses in 

milliradians: 23.3, 11.7, 5.82, 2.91 


Fifteen thousand trials with eighteen observers were conducted using 
the 160 images degraded as discussed above, and the experiment was con- 
cluded by statistical analyses of the data. 

Williams’s results are startling, for they contradict the conventional 
wisdom in FLIR that small variations in MTF and SNR have significant 
effects on recognition performance. Williams found that the raster structure 
apparently so thoroughly disrupts the image that gross SNR and MTF changes 
have little effect on recognition performance. 

The correlation of image quality variables with recognition was deter- 
mined by a linear regression analysis. The best single measure of recognition 
accuracy for this experiment is the resolving power for tribar targets with the 
long bar dimension parallel to the raster. The next best parameter was the 
number of raster lines on the minimum target dimension. 

The fact that tribar resolving power is a good predictor is consistent 
with the results of Johnson!! for image intensifiers and of Rosell and 
Willson’? for television. This is a useful result because vertical resolving 
power combines sampling rate, vertical spread function, SNR, and magnifica- 
tion in a single observer response measure. Thus resolving power remains a 
good quality measure when sampling is present. 

Figure 10.19 shows the experimental limiting resolutions for bar targets 
with the bar length oriented in the scan direction. The limiting resolutions 
are plotted as functions of the overall picture height and of the number of 
scan lines. This clearly shows the effect of increasing magnification at con- 
stant line rate. The same data are replotted in Figure 10.20 to show the 
effect of increasing line rate at constant magnification. The pertinent results 
of the study are shown in Figures 10.21 through 10.24. 

One strong conclusion of the study is that raster sampling dominates 
performance, so much so that noise and MTF have minor roles in recognition. 
This does not contradict many earlier photointerpretation studies which 
demonstrated a strong dependence of performance on MTF in photographs. 
Rather, this study shows that aliasing in sampled imagery so completely dis- 
rupts the image at low values of lines per target that MTF variations have no 
significant effect. 
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Figure 10.20 Vertical tribar limiting resolution. 
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Figure 10.21 Probability of recognition as a function of resolving power. 
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Figure 10.22 Probability of recognition as a function of lines per target and noise. 
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Figure 10.23 Probability of recognication as a function of lines per target and size. 
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Figure 10.24 Probability of recognition as a function of lines per target and MTF ata 
frequency equal to the reciprocal of the line center to line center spacing. 


Several rules of thumb are evident from'the study results: 


1. 90% recognition accuracy requires on the average 24 raster lines 
per target height, or 12 resolvable cycles per target height. 

2. 50% recognition accuracy requires on the average 15 raster lines 
per target height, or 7.5 resolvable cycles per target. 

3. One raster line need not subtend more than 1 mrad at the eye, but 
cannot subtend less than about 0.5 mrad without causing sub- 
stantial loss in performance. 


These values are consistent with current FLIR practice, viz a 350 line, 
0.25 mrad sensor with a display height of six inches viewed at a maximum 
distance of 30 inches to give 0.57 mrad per line. 

Some interesting examples of computer-processed FLIR imagery are 
shown in Figures 10.25 and 10.26. The imagery was processed! 3 to simulate 
a parallel-scanned system whose MTF is dominated by a detector (read-in) 
aperture and by a light-emitting diode (write-out) aperture. Figure 10.25 
shows imagery processed to 125 scan lines with a constant write-out aperture 
and with varied read-in apertures and point signal-to-noise ratios. Figure 
10.26 shows 31-line imagery with the same relative variations. The original 
thermograph from which these variations were produced is shown in 
Figure 10.27. 
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10.10 Other Observer Limitations 


Bennett, et al.?^? found that one resolution element should not be 
magnified so that it exceeds more than 3 arcminutes (approximately 1 mrad) 
at the eye. This result is substantiated by Hemingway and Erickson?! , and is 
consistent with the investigations discussed in Section 4.7. 

Williams and Borow’ found no degradation in search performance with 
displayed scene angular motion of from 0 to 8 deg/s. At 16 deg/s, search 
time ‘increased approximately 25%, and at 31 deg/s search time increased 
approximately 10046. They found also that horizontal motion produces less 
degradation than vertical motion. 

Williams and Borow’ also found that individual observers exhibited 
variations of four-to-one in information extraction capability. Similarly, 
Thornton, et а1.32 found that different observers exhibited variations of 
about five-to-one in photointerpreter accuracy, completeness, and search 
time. Erickson? found that search time decreases as observer peripheral 
visual acuity increases. Bennett, et al.?? found that they could triple photo- 
interpreter performance simply by informing observers that a particular 
target (one of many present) was in the scene. 

Sziklai?? generated rapidly changing sequences of television pictures 
containing only well-defined objects, and found that about 3 to 5 symbols 
or objects were recognizable per second under ideal conditions. 


10.11 Optimization of System Parameters 


Cost effective system design requires that only as much performance as 
is required be incorporated into the design. Therefore it is necessary to 
understand how various observer limitations interact in a system. As an 
example, consider a simple system having a square field of view and a 
contiguous raster with no overscan. Assume that for search efficiency, for 
display size limitations, for flicker perceptibility, for optimum surround 
factor, or for other reasons, that we have decided on acceptable display 
angles of A' by A' [milliradians]. Assume that from general image quality 
considerations, we have chosen an angular magnification M which couples the 
eye and the system efficiency. Then the sensor field-of-view size is A = A'/M 
[milliradians] square. 

Now assume that a specific recognition probability is required for a 
target with a critical target dimension of CTD in meters located at a slant 
range of R [kilometers]. The critical angular subtense of this target is 9. 
[milliradians] and equals CTD/R. The required recognition probability 
dictates an approximate number of raster lines (or resolution elements in a 
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nonsampled system) subtended by 0-, say L lines. Then one line must sub- 
tend 0./L [milliradians] . This recognition probability also dictates іп a rather 
loose way some minimum critical target angle subtended at the eye, say 0 с. 
which is consistent with the eye's sine wave response and the target spatial 
frequency spectrum. Now 0, and 6’, are related by: 


M = 0',/0., (10.28) 
but we also have 

M = A'/A, (10.29) 
SO 

АЈА = 0'./0. or А = A'0,/0',. (10.30) 


Thus the sensor field-of-view is determined by the choice of an efficient 
display angle for search and by a recognition requirement. For example, 
some reasonable numbers are: 


A' = 164 mrad (approximately 9°) 
Ө. = 1 mrad (3 meters at 3 kilometers) 


6’. = 4 mrad (approximately 12 arcminutes). 


These values give a field of view of A = 164 (1)/4 = 40 mrad = 2.3°, anda 
magnification of approximately 4.1. If we pick P[Rec] = 50%, a reasonable 
value of L is 4 lines. Then one line subtends 6,/L = 0.25 mrad, and the total 
number of channels is set at 40 mrad/0.25 mrad = 160 channels. Finally, for 
an effective observer distance D from the display in meters, and for small 
A’, the effective display width W in millimeters is W = D A'm. A cockpit 
display might have D = 0:6m (approximately two feet), so that for our 
example W would be 0.6 (164) mm = 98 mm = 4 inches. 

Many thermal imaging systems now in use are suboptimal in at least one 
respect because the display size, the line subtense, the field-of-view, and the 
magnification are usually selected independently and for unrelated reasons. 
For example, the field-of-view may be chosen for ground coverage at a 
particular altitude, while the magnification is chosen independently for 
resolution efficiency. Then a large part of the display may be rendered 
unusable because the observer cannot absorb all its contents. 
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The temptation to overspecify or overdesign a system is always strong 
because the state-of-the-art permits a higher field-of-view to resolution ratio 
than a human observer can possibly use efficiently. In the numerical example 
given above, the figure of 160 lines per field-of-view would be low for current 
high performance FLIR’s. The result is that FLIR performance tends to be 
operator-limited rather than equipment limited. 
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CHAPTER ELEVEN — SYSTEM PERFORMANCE TESTING 


11.1 Signal Transfer Function and Dynamic Range 


We have defined a thermal imaging system as a device which converts 
optical radiation in the far infrared to visible radiation in such a way that 
information can be extracted from the resulting image. Typically the power 
gain of this conversion is controlled by the “‘contrast”’ control of the video 
monitor, and an arbitrary dc level is added by the “brightness’’ control. We 
must be able to describe and to measure this power conversion and its depend- 
ence on control settings to assure ourselves that the observer may optimize 
his information extraction capability by adjusting the controls. By this we 
mean that satisfactory surround factors can be achieved for the anticipated 
environments, that noise-limited imagery may be produced, and that high 
contrast imagery may be produced. 

For convenience, the input-to-output power conversion is measured at 
a point in the image of a large square target. The input power is normally 
expressed in units of degrees centigrade of effective blackbody temperature 
difference between the target and its background measured within the ther- 
mal spectral bandpass of the system. The output power is usually represented 
by the photopic luminance of the displayed target image, as measured by a 
photometer*. 

The power conversion is described by what Moulton, et al. call the 
Signal Transfer Function (SITE). The SiTF is defined as the photopic lum- 
inance output of the system display as a function of the target-to-background 
temperature difference in the test pattern of Figure 11.1 for various combina- 
tions of system control settings. 


*The subject of photometry is treated in References 1 through 4. 
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Figure 11.1 Si TF test pattern. 
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Figure 11.2 Idealized Si TF’s for different settings of linear 
brightness control B and contrast control C. 
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An SiTF test is typically performed as follows. The square and its back- 
ground are set equal to the ambient temperature and the brightness and gain 
controls are adjusted to settings representative of some useful operating con- 
dition. A microphotometer is focused on the center of this square on the 
display and the luminance for a AT of zero degrees is recorded. The temp- 
erature of the square is subsequently changed in small increments and the 
luminance is plotted as a function of the temperature difference between the 
square and the background temperatures. The measurements are repeated for 
other control settings until the system's operation is sufficiently character- 
ized. A set of idealized SITFs for settings of linear brightness control B and 
contrast control C is shown in Figure 11.2. 

The dynamic range of a system is described by a temperature difference 
dynamic range and by a luminance dynamic range, both of which are functions 
of control settings and are extractable from SiTF. The dynamic range of AT 
is the maximum scene AT which is displayed unclipped for a particular con- 
dition of control settings. The maximum dynamic range of AT is the maxi- 
mum unclipped AT for the lowest gain setting. The luminance dynamic 
range gives the maximum luminance the system produces. 


11.2 Optical Transfer Function Tests 


Chapter Three defined the optical transfer function (OTF) for thermal 
imaging systems and presented the rationale for calculating the OTF from 
a measured line spread function (LSF). Figure 11.3 shows a simplified typical 
test scheme for generating.a blackbody line source and for measuring the LSF 
and OTF. Two alternatives are indicated: photometric slit scanning of the 
image of a fixed line source, and thermal line image scanning across a fixed 
photometric slit. 

The thermal line is generated by an approximate blackbody source such 
as a thermoelectric heat pump or an electrically heated plate which back- 
lights an opaque blackbody mask with a small slit milled or chemically-etched 
in it. The source and the mask must have the same spectral emissivity, and 
both must be controllable and uniform both in contact temperature and in 
emissivity to within about +0.05°C for measurement of good-quality thermal 
imagers. In the scheme of Figure 11.3 the plane of the slit mask is located at 
the focal plane of an infrared collimator, and the source is separated from the 
mask sufficiently to minimize radiative heat transfer between the source and 
mask. 

Ideally the apparent and contact temperatures of each surface should be 
the same, so that the source and mask temperature may be measured by 
embedded thermocouples, thermistors, or semiconductor diodes. This is 
desirable because small temperature differences may be more reliably and 
inexpensively measured by direct contact than by radiometry. The thermal 
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Figure 11.3 Schematic of a typical line spread function test set. 
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slit should be much smaller than the spatial resolution of the system, but not 
so small that a temperature difference sufficient to generate a good signal-to- 
noise ratio cannot be produced. The effective temperature difference of the 
slit is approximately the actual temperature difference times the ratio of the 
slit angular width to the system resolution. 

The collimator may be either refractive or reflective, but a high- 
resolution, broad spectrum lens is more easily and inexpensively fabricated as 
a reflector. A disadvantage of reflective collimators is that baffles must be 
used to avoid the sensor’s seeing defocused images of itself and of the 
laboratory retro-reflected in the lens. 

Before the OTF measurement begins, the OTF's of each of the measuring 
and recording devices must be known, and the thermal slit spatial frequency 
spectrum must be known. The devices which may have non-unity OTF’s are 
the collimator, the microphotometer, and the X-Y recorder. The non-unity 
spatial frequency spectrum of the line source has the effect of an OTF on the 
measured spread function, and this must be compensated for in the 
calculations. 

The first step in the OTF measurement is the focusing of the sensor on 
the collimator focal plane image, and the alignment of the sensor so that the 
slit is perpendicular to the direction in which the OTF is to be measured. If 
the system has an eyepiece, it should be adjusted to produce a collimated 
output. Then the displayed slit image must be focused onto the plane of the 
scanning slit photometer relay optic, and must be aligned parallel with the 
scanning slit. 

The second step is the determination of the experiment scale factor, 
that is, the conversion of millimeters in the scanning photometer slit plane 
to milliradians in object space. This allows a line spread function measured in 
millimeters to be converted to the desired object plane milliradian units. One 
procedure for measuring scale factor which minimizes error due to image 
spread is the introduction of a bar pattern of known angular period and the 
measurement and averaging of the separation between peak luminances on 
the display. 

The third step in the OTF measurement is the establishment of the OTF 
existence conditions given in Chapter Three, which are: linearity, spatial in- 
variance, and high signal-to-noise ratio. Presumably the spatial invariance 
requirement is satisfied by choosing a very small region on the display in 
which to make the measurement. However, it is exceedingly difficult to 
satisfy the linearity and low noise requirements simultaneously because 
virtually all systems are noisy, nonlinear, and have limited dynamic range. 
Thus a bright, high signal-to-noise ratio image is likely to have been pro- 
cessed nonlinearly, while a linearly processed image is likely to be noisy. It is 
therefore important to know the nature of the system’s signal transfer 
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function. An associated problem is that the photomultiplier used to measure 
the display luminance must be relatively noise-free and operated linearly. 
The essence of this third step is the selection of an effective temperature 
difference which gives a signal-to-noise ratio of around ten or better, and the 
selection of system control settings for which the SiTF is quasi-linear over 
the signal and noise range. 

The fourth step is the generation of the line spread function and dis- 
section of it with the moving photometric slit probe. This probe must be 
moved slowly so that the framing action of the system in combination with 
the probe motion does not undersample the LSF. This also allows narrow- 
band noise suppression circuits to be used without distorting the LSF. The 
LSF is commonly recorded on an X-Y plotter, although the more sophisticated 
automated techniques described later may be used. 

The LSF may be transformed approximately by fitting it to an LSF 
whose transform is known, such as a gaussian. Except in special cases, this is 
not sufficiently accurate, and the LSF must be transformed using a computer. 
This apparent OTF must then be compensated for measurement errors by 
dividing by the OTF of the measurement apparatus and by the line source 
spatial frequency spectrum. 

The OTF test procedure outlined is sufficiently time consuming and 
emotionally exhausting to warrant automating the data taking. Typically 
the investigator will perform the measurement several times until all the 
experimental problems are worked out and a satisfactory result is obtained, 
a process that can easily consume a full working day. Moulton, et al.,> and 
the author® have integrated OTF test equipments with digital control com- 
puters to reduce the aggravation of manual OTF. Figure 11.4 is a schematic 
of the essentials of such a computer controlled test set. The analog voltages 
representing the scanning slit position and the corresponding LSF amplitude 
are relayed to a computer-controlled dual channel analog-to-digital converter. 
The A-to-D converter digitizes the analog data and relays them to the com- 
puter for storage in memory. The computer performs a fast-Fourier-transform 
of the LSF and plots the LSF, MTF, and PTF ona plotter. An example of 
such a measurement was given earlier in Figure 3.10. 

There are many experimental errors which can cause results of an OTF 
test to be invalid. Sendall’ has pointed out that the high instantaneous 
display element luminance required to achieve an acceptable average display 
luminance in a fast-framing system may saturate the photomultiplier tube 
used. Thus care should be taken to insure that the photomultiplier has an 
adequate dynamic range. Brown® has indicated the necessity to view the 
LSF through a photometer entrance pupil of eye size when measuring sys- 
tems with eyepieces. This prevents lens aberrations not seen by the operator 
from yielding OTF’s inconsistent with visual experience. Another problem 
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with the OTF measurement is that it is difficult to assess the accuracy of the 
result. Perhaps the best way to recognize whether a given measurement is 
the "true" OTF is to compare the measurement with predictions and with 
the nature of the MRTD measured. OTF measurement techniques employed 
on visible spectrum systems are described in references 9 through 16. 


11.3 Minimum Resolvable Temperature Difference Test 


Chapter Five defined the minimum resolvable temperature difference 
(MRTD) as the equivalent blackbody temperature difference between the 
target of Figure 5.5 at a given frequency and its background such that the 
individual bars are resolvable by an observer. Care must be taken in devising 
test equipment and a test procedure to ensure that the MRTD existence 
conditions are satisfied. The system controls must be set, or varied between 
limits, such that the system is operated quasi-linearly, and so that noise is 
just visible on the display. In this way the system will be ‘‘noise-limited”’ 
rather than "contrast-limited."' The first condition сап be verified by referring 
to SiTF curves, while the second condition can be verified only by the 
Observer's subjective impression of the noise level. 

The test charts may be either finite conjugate sources located at some 
in-focus object distance, or they may be infinite conjugate sources simu- 
lated by a collimator. Finite conjugate sources have the disadvantage that 
low frequency patterns must be rather large. In that case uniformity and 
controllability of the targets will likely be a problem. The test procedure 
discussed here assumes that a collimator is used. 

A sufficient number of test patterns must be available to investigate the 
spatial frequency range of the system. The collimator focal plane must be 
determined and the targets located there. The target and background 
emissivities must be equal, uniform, and known. The temperature differential 
generating and sensing equipment must be calibrated. The target generator 
should be stable, controllable, and spatially uniform to within +0.01°C, and 
the accuracy of the measuring devices should be within +0.01°C. The col- 
limator should be optically baffled and the sensor oriented so that back 
reflections of the laboratory scene into the sensor are minimized. The spectral 
transfer and optical transfer functions of the collimator must have no detri- 
mental effect on the sensor. 

The observer or observers conducting the test must have good visual 
acuity and normal color vision. They should be thoroughly familiar with 
thermal imagery and be able to interpret noisy imagery. They should be 
dark-adapted to the average luminance level of the display, and the back- 
ground (laboratory) ambient luminance should approximately match the 
display luminance. Further, the observers should be trained to identify and 
to reproduce the conditions for which their confidence in recognition accu- 
racy is high. 
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A typical test procedure would progress as follows. The sensor of the 
system to be tested is mounted in the beam of the collimator, and the 
sensor is focused using an appropriate technique. The lowest frequency test 
target is introduced into the field at a position known to the observer, and at 
0°С AT. The AT is increased positively and at a rate slow enough to ensure 
target uniformity and measurement accuracy, and to allow the observer a 
sufficient number of decision times to make a valid decision. The observer 
then indicates the target AT at which he resolves the bars. The procedure is 
repeated for higher and higher frequencies until the observer is no longer 
able to resolve the bars at any AT. For target temperature differences low 
enough to assure linear system operation for both positive and negative con- 
trast targets (usually only low frequencies), negative AT’s may be used to 
find the negative MRT. This will indicate any temperature offset in the test 
set. 


11.4 Noise Equivalent Temperature Difference Test 


Citation of a measured NETD is virtually meaningless unless the noise- 
limiting electrical filter used in the measurement is also specified. It is desirable 
for standardization and comparison purposes to use as the noise filter a 
single — RC low pass filter whose 3-db frequency is the reciprocal of twice 
the dwelltime. Since electrical noise filtering in the system itself is usually 
minimal for good quality, this noise filter can frequently be external to the 
system and a part of the test equipment. 

To assure that all noise sources prior to the video display are accounted 
for in the measurement, it is customary to take the test signals from the final 
processing stage at which an analog video signal exists. This means prior to a 
multiplexer or pulse-width modulator. If this point is conveniently accessible, 
the conduct of the NETD test is trivial. One simply generates a large square 
blackbody thermal target whose angular dimension exceeds the system resolu- 
tion by several times and whose temperature exceeds the expected NETD by 
a factor of ten or so, and then measures the peak to peak signal and true 
RMS noise in the electronic image of that target. The NETD is then the 
target temperature difference divided by the peak signal to rms noise ratio. 
Care must be taken to assure that the system is operated linearly and that no 
noise sources are included which are not displayed, such as switching transi- 
ents, or reference signals which appear during the display blank time. Though 
it is a somewhat dubious figure of merit, NETD will long endure because its 
measurement does not require the skill and experience necessary for OTF 
and MRTD tests, so that virtually anyone can do it correctly. 
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CHAPTER TWELVE — THERMAL IMAGERY 


The quality of imagery which the current FLIR state-of-the-art can 
produce is exemplified by the eleven figures which follow. These photographs 
were produced by Sawyer! from a FLIR monitor using an exposure time of 
0.2 second to closely approximate the visual impression of noisiness in the 
real-time imagery. The particular FLIR used is a serial scan television- 
compatible system. Some of the imagery has superimposed on it a reticle, 
the square gate of an automatic target tracker, and date-time symbology. 


!L.H. Sawyer, personal communication, Honeywell Radiation Center, Lexington, Massachusetts. 
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Figure 12.1 Highway and bridge through a light industrial area. 
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Figure 12.2 Ап urban scene. 
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Figure 12.3 A concert shell. 
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Figure 12.4 A cargo helicopter. 
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Figure 12.6 A capitol building. 
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Figure 12.7 A sparsely vegetated area. 
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Figure 12.8 Freighters. 
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Figure 12.9 A power plant. 
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